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COMPOSITIONS AND METHODS FOR HYDROXYLATING 



EPOTHILONES 



10 Field of the Invention 

The present invention relates to isolated nucleic acids sequences and 
polypeptides encoded thereby for epothilone B hydroxylase and mutants and variants 
thereof, and a ferredoxin located downstream from the epothilone B hydroxylase 
gene. The present invention also relates to recombinant microorganisms expressing 

15 epothilone B hydroxylase or a mutant or variant thereof and/or ferredoxin which are 
capable of hydroxylating small organic molecule compounds, such as epothilones, 
having a terminal alkyl group to produce compounds having a terminal hydroxyalkyl 
group. Also provided are methods for recombinantly producing such microorganisms 
as well as methods for using these recombinant microorganisms in the synthesis of 

20 compounds having a terminal hydroxylalkyl group. The compositions and methods 
of the present invention are useful in preparation of epothilones having a variety of 
utilities in the pharmaceutical field. A novel epothilone analog produced using a 
mutant of epothilone B hydroxylase of the present invention is also described. 

25 Background of the Invention 

Epothilones are macrolide compounds that find utility in the pharmaceutical 
field. For example, epothilones A and B having the structures: 




R 



O 



OH O 



30 



Epothilone A 
Epothilone B 



R=:H 



R=Me 



have been found to exert microtubule-stabiHzing effects similar to paclitaxel 
(TAXOL®) and hence cytotoxic activity against rapidly proliferating cells, such as. 



wo 03/057830 



PCT/US02/40359 



5 tumor cells or cells associated with other hyperproliferative cellular diseases, see 
BoUag et aL, Cancer Res ., Vol. 55, No. 11, 2325-2333 (1995). 

Epothilones A and B are natural anticancer agents produced by Sorangium 
cellulosum that were first isolated and characterized by Hofle et al., DE 4138042; WO 
93/10121; Angew. Chem. Int. Ed. Engl . Vol. 35, Nol3/14, 1567-1569 (1996); and L 

10 Antibiot .. Vol. 49, No. 6, 560-563 (1996). Subsequently, the total syntheses of 
epothilones A and B have been published by Balog et aL, Angew. Chem. Int. Ed. 
Engl .. Vol. 35, No. 23/24, 2801-2803, 1996; Meng et al, J. Am. Chem. Soc , Vol. 
119, No. 42, 10073-10092 (1997); Nicolaou et aL J, Am. Chem. Soc .. Vol. 119, No. 
34, 7974-7991 (1997); Schinzer a/., Angew. Chem. Int. Ed. Eng .. Vol. 36, No. 5, 

15 523-524 (1997); and Yang et ai, Angew. Chem. Int. Ed. Engl. . Vol. 36, No. 1 / 2, 
166-168, 1997. WO 98/25929 disclosed the methods for chemical synthesis of 
epothilone A, epothilone B, analogs of epothilone and libraries of epothilone analogs. 
The structure and production from Sorangium cellulosum DSM 6773 of epothilones 
C, D, E, and F was disclosed in WO 98/22461. Figure 1 provides a diagram of the 

20 biotransformation as described in WO 00/39276 of epothilone B to epothilone F in 
Actinomycetes species strain SCI 5847 (ATCC PT-1043), subsequently identified as 
Amycolatopsis orientalis. 

Cytochrome P450 enzymes are found in prokaryotes and eukaryotic cells and 
have in common a heme binding domain which can be distinguished by an 

25 absorbance peak at 450 nm when complexed with carbon monoxide. Cytochrome 
P450 enzymes perform a broad spectrum of oxidative reactions on primarily 
hydrophobic substrates including aromatic and benzylic rings, and alkanes. In 
prokaryotes they are found as detoxifying systems and as a first enzymatic step in 
metabolizing substrates such as toluene, benzene and camphor. Cytochrome P450 

30 genes have also been found in biosynthetic pathways of secondary metabolites such as 
nikkomycin in Streptomyces tendae (Bruntner, C. et al, 1999, Mol. Gen. Genet. 262: 
102-114), doxorubicin (Dickens, M.L, Strohl, W.R., 1996, J. Bacteriol, 178: 3389- 
3395) and in the epothilone biosynthetic cluster of Sorangium cellulosum (Julien, B. 
et al., 2000, Gene, 249: 153-160). With a few exceptions, the cytochrome P450 

35 systems in prokaryotes are composed of three proteins; a ferredoxin NADH or 
NADPH dependent reductase, an iron-sulfur ferredoxin and the cytochrome P450 
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5 enzyme (Lewis, D.R, Hlavica, P., 2000, Biochim. Biophys. Acta., 1460: 353-374). 
Electrons are transferred from ferredoxin reductase to the ferredoxin and finally to the 
cytochrome P450 enzyme for the splitting of molecular oxygen. 

Summary of the Invention 

10 An object of the present invention is to provide isolated nucleic acid sequences 

encoding epothilone B hydroxylase and variants or mutants thereof and isolated 
nucleic acid sequences encoding ferredoxin or variants or mutants thereof. 

Another object of the present invention is to provide isolated polypeptides 
comprising amino acid sequences of epothilone B hydroxylase and variants or 
15 mutants thereof and isolated polypeptides comprising amino acid sequences of 
ferredoxin and variants or mutants thereof. 

Another object of the present invention is to provide structure coordinates of 
the homology model of the epothilone B hydroxylase. The structure coordinates are 
listed in Appendix 1. This model of the present invention provides a means for 
20 designing modulators of a biological function of epothilone B hydroxylase as well as 
additional mutants of epothilone B hydroxylase with altered specificities. 

Another object of the present invention is to provide vectors comprising 
nucleic acid sequences encoding epothilone B hydroxylase or a variant or mutant 
thereof and/or ferredoxin or a variant or mutant thereof. In a preferred embodiment, 
25 these vectors further comprise a nucleic acid sequence encoding ferredoxin. 

Another object of the present invention is to provide host cells comprising a 
vector containing a nucleic acid sequence encoding epothilone B hydroxylase or a 
variant or mutant thereof and/or ferredoxin or a variant or mutant thereof. 

Another object of the present invention is to provide a method for producing 
30 recombinant microorganisms that are capable of hydroxylating compounds, and in 
particular epothilones, having a terminal alkyl group to produce compounds having a 
terminal hydroxyalkyl group. 

Another object of the present invention is to provide microorganisms produced 
recombinantly which are capable of hydroxylating compounds, and in particular 
35 epothilones, having a terminal alkyl group to produce compounds having a terminal 
hydroxyalkyl group. 
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5 Another object of the present invention is to provide methods for 

hydroxylating compounds in these recombinant microorganisms. In particular, the 
present invention provides a method for the preparation of hydroxyalkyl-bearing 
epothilones, which compounds find utility as antitumor agents and as starting 
materials in the preparation of other epothilone analogs. 

10 Yet another object of the present invention is to provide a compound of 

Formula A: 




referred to herein as 24-OH epothilone B or 24-OH EpoB, as well as compositions 
15 and methods for production of compositions comprising the compound of Formula A. 

Brief Description of the Figures 

Figure 1 provides a schematic of the biotransformation as set forth in WO 
00/39276, U.S. Application Serial No. 09/468,854, filed December 21. 1999, of 
20 epothilone B to epothilone F by Amycolatopsis orientalis strain SCI 5847 (PTA1043). 

Figure 2 shows the nucleic acid sequence alignments of SEQ ID N0:5 through 
SEQ ID NO:22 used to design the PGR primers for cloning of the nucleic acid 
sequence encoding epothilone B hydroxylase. 

Figure 3 shows the sequence alignment between epothilone B hydroxylase 
25 (SEQ ID N0:2) and EryF (PDB code IJIN chain A; SEQ ID NO:76). The asterisks 
indicate sequence identities, the colons (:) similar residues. 

Figure 4 provides a homology model of epothilone B hydroxylase based upon 
sequence alignment with EryF as shown in Figure 3. 

Figure 5 shows an energy plot of the epothilone B hydroxylase model 
30 (indicated by dashed line) relative to EryF (PDB code IJIN; indicated by solid line). 
An averaging window size of 51 residues was used, i.e., the energy at a given residue 



-4- 
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5 position is calculated as the average of the energies of the 51 residues in the sequence 
that lie with the given residue at the central positions. 

Detailed Description of the Invention 



polypeptides and methods for obtaining compounds with desired substituents at a 
terminal carbon position. In particular, the present invention provides compositions 
and methods for the preparation of hydroxyalkyl-bearing epothilones, which 
compounds find utility as antitumor agents and as starting materials in the preparation 
1 5 of other epothilone analogs. 

The term "epothilone," as used herein, denotes compounds containing an 
epothilone core and a side chain group as defined herein. The term "epothilone core/' 
as used herein, denotes a moiety containing the core structure (with the numbering of 
ring system positions used herein shown): 



10 



The present invention relates to isolated nucleic acid sequences and 




20 



o 



X 



wherein the substituents are as follows: 



Q is selected from the group consisting of 




and 



25 



WisOor NRe; 



X is selected from the group consisting of O, H and OR?; 
M is O, S, NRg, CR9R10; 

Bi and B2 are selected from the group consisting of ORi i, OCOR12; 
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5 R1-R5 and R12-R17 are selected from the group consisting of H, alkyl, 

substituted alkyl, aryl, and heterocyclo, and wherein Ri and R2 are alkyl they can be 
joined to form a cycloalkyl; 

Re is selected from the group consisting of H, alkyl, and substituted alkyl; 
R7 and Ri i are selected from the group consisting of H, alkyl, substituted 
10 alkyl, trialkylsilyl, alkyldiarylsilyl and dialkylarylsilyl; 

Rg is selected from the group consisting of H, alkyl, substituted alkyl. Ri3C=0, 
Ri40C=0 and R15SO2; and 

R9 and Rio are selected from the group consisting of H, halogen, alkyl, 
substituted alkyl, aryl, heterocyclo, hydroxy, Ri6C=0, and Ri70C=0. 
15 The term "side chain group" refers to substituent G as defined above for 

Epothilone A or B or Gi and G2 as shown below. 
Gi is the following formula V 

HO-CH2-(Ai)n-(Q)„.-(A2)o (V), 

and 

20 G2 is the following formula VI 

CH3-(A,)n-(Q)m-(A2)o (VI), 

where 

A I and A2 are independently selected from the group of optionally substituted 
Ci-Cs alkyl and alkenyl; 
25 Q is optionally substituted ring system containing one to three rings and at 

least one carbon to carbon double bond in at least one ring; and 

n, m, and 0 are integers independently selected from the group consisting of 
zero and 1, where at least one of m, n or 0 is 1. 

The term "terminal carbon" or "terminal alkyl group" refers to the terminal 
30 carbon or terminal methyl group of the moiety either directly bonded to the epothilone 
core at position 15 or to the terminal carbon or terminal alkyl group of the side chain 
group bonded at position 15. It is understood that the term "alkyl group" includes 
alkyl and substituted alkyl as defined herein. 

The term "alkyl" refers to optionally substituted, straight or branched chain 
35 saturated hydrocarbon groups of 1 to 20 carbon atoms, preferably 1 to 7 carbon atoms. 
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5 The expression "lower alkyf* refers to optionally substituted alkyl groups of 1 to 4 
carbon atoms. 

The term "substituted alkyl" refers to an alkyl group substituted by, for 
example, one to four substituents, such as, halo, trifluoromethyl, trifluoromethoxy, 
hydroxy, alkoxy, cycloalkyloxy. heterocyclooxy, oxo, alkanoyl, aryloxy, alkanoyloxy, 

10 amino, alkylamino, arylamino, aralkylamino, cycloalkylamino, heterocycloamino, 
disubstituted amines in which the 2 amino substituents are selected from alkyl, aryl or 
aralkyl, aikanoylamino, aroylamino, aralkanoylamino, substituted alkanoylamino, 
substituted arylamino, substituted aralkanoylamino, thiol, alkylthio, arylthio, 
aralkylthio, cycloalkylthio, heterocyclothio, alkylthiono, arylthiono, aralkylthiono, 

15 alkylsulfonyl, arylsulfonyl, aralkylsulfonyl, sulfonamide (e.g. SO2NH2), substituted 
sulfonamido, nitro, cyano, carboxy, carbamyl (e.g. CONH2), substituted carbamyl 
(e.g. CONH alkyl, CONH aryl, CONH aralkyl or cases where there are two 
substituents on the nitrogen selected from alkyl, aryl or aralkyl), alkoxycarbonyl, aryl, 
substituted aryl, guanidino and heterocyclos, such as, indolyl, imidazolyl, furyl, 

20 thienyl, thiazolyl, pyrrolidyl, pyridyl, pyrimidyl and the like. Where noted above 
where the substituent is further substituted it will be with halogen, alkyl, alkoxy, aryl 
or aralkyl. 

In accordance with one aspect of the present invention there are provided 
isolated polynucleotides that encode epothilone B hydroxylase, an enzyme capable of 
25 hydroxylating epothilones having a terminal alkyl group to produce epothilones 
having a terminal hydroxyalkyl group. 

In accordance with another aspect of the present invention there are provided 
isolated polynucleotides that encode a ferredoxin, the gene for which is located 
downstream from the epothilone B hydroxylase gene. Ferredoxin is a protein of the 
30 cytochrome P450 system. 

By "polynucleotides", as used herein, it is meant to include any form of DNA 
or RNA such as cDNA or genomic DNA or mRNA, respectively, encoding these 
enzymes or an active fragment thereof which are obtained by cloning or produced 
synthetically by well known chemical techniques. DNA may be double- or single- 
35 stranded. Single-stranded DNA may comprise the coding or sense strand or the non- 
coding or antisense strand. Thus, the term polynucleotide also includes 
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5 polynucleotides exhibiting at least 60% or more, preferably at least 80%, homology to 
sequences disclosed herein, and which hybridize under stringent conditions to the 
above-described polynucleotides. As used herein, the term "stringent conditions" 
means hybridization conditions of 60°C at 2xSSC buffer. More preferred are isolated 
nucleic acid molecules capable of hybridizing to the nucleic acid sequence set forth in 

10 1, 30, 32, 34, 36, 37, 38, 39, 40, 41, 42, 60, 62, 64, 66, 68, 70, 72, or 74 or SEQ ID 
N0:3, or to the complementary sequence of the nucleic acid sequence set forth in 
SEQ ID N0:1, 30, 32, 34, 36, 37, 38, 39, 40, 41, 42, 60, 62 ,64, 66, 68, 70, 72 ,or 74 
or SEQ ID N0:3, under hybridization conditions of 3X SSC at 65'^C for 16 hours, 
and which are capable of remaining hybridized to the nucleic acid sequence set forth 

15 in SEQ ID N0:1, 30, 32, 34, 36, 37, 38, 39, 40, 41, 42, 60, 62, 64, 66, 68, 70, 72 or 74 
or SEQ ID N0:3, or to the complementary sequence of the nucleic acid sequence set 
forth in SEQ ID N0:1, 30, 32, 34, 36, 37, 38, 39. 40, 41 or 42, 60, 62, 64, 66, 68, 70, 
72 or 74 or SEQ ID NO: 3, under wash conditions of 0.5X SSC, 55°C for 30 minutes. 
In one embodiment, a polynucleotide of the present invention comprises the 

20 genomic DNA depicted in SEQ ID NO: 1 or a homologous sequence or fragment 
thereof which encodes a polypeptide having similar activity to that of this epothilone 
B hydroxylase. Alternatively, a polynucleotide of the present invention may comprise 
the genomic DNA depicted in SEQ ID N0:3 or a homologous sequence or fragment 
thereof which encodes a polypeptide having similar activity to this ferredoxin. Due to 

25 the degeneracy of the genetic code, polynucleotides of the present invention may also 
comprise other nucleic acid sequences encoding this enzyme and derivatives, variants 
or active fragments thereof. 

The present invention also relates to variants of these polynucleotides which 
may be naturally occurring, i.e., present in microorganisms such as Amycolatopsis 

30 orientalis and Amycolata autotrophica, or in soil or other sources from which nucleic 
acids can be isolated, or mutants prepared by well known mutagenesis techniques. 
Exemplary variants polynucleotides of the present invention are depicted in SEQ ID 
NO: 36-42. 

By "mutants" as used herein it is meant to be inclusive of nucleic acid 
35 sequences with one or more point mutations, or deletions or additions of nucleic acids 
as compared to SEQ ID NO: 1 or 3, but which still encode a polypeptide or fragment 



-8- 
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5 with similar activity to the polypeptides encoded by SEQ ID NO: 1 or 3. In a 

preferred embodiment, mutations are made which alter the substrate specificity and/or 
yield of the enzyme. A preferred region of mutation with respect to the epothiione B 
hydroxylase gene is that region of the nucleic acid sequence coding for the 
approximately 1 13 amino acids residues comprising the active site of the enzyme. 
10 Also preferred are mutants encoding a polypeptide with at least one amino acid 
substitution at amino acid position GLU31, ARG67, ARG88, rLE92, ALA93, 
VAL106, ILE130, ALAMO, MET176, PHE190, GLU 231, SER294, PHE237, or 
ILE365 of SEQ ID NO: 1 . Exemplary polynucleotide mutants of the present invention 
are depicted in SEQ ID NO: 30, 32, 34, 60, 62, 64, 66, 68, 70, 72 and 74. 
15 Cloning of the nucleic acid sequence of SEQ ID NO: 1 encoding epothiione B 

hydroxylase was performed using PGR primers designed by aligning the nucleic acid 
sequences of six cytochrome P450 genes from bacteria. The following cytochrome 
P450 genes were aligned: 

Sequence 1: Locus: STMSUACB; Accession number: M32238; Reference: 
20 Omer, C.A., J. Bacteriol. 172: 3335-3345 (1990) 

Sequence 2: Locus: STMSUBCB; Accession number: M32239; Reference: 

Omer, C.A.. J. Bacteriol. 172: 3335-3345 (1990) 
Sequence 3: Locus: ABO 18074 (formerly STMORFA); Accession number: 

AB018074; Reference: Ueda, K., J. Antibiot. 48: 638-646 (1995) 
25 Sequence 4: Locus: SSU65940; Accession number: U65940; Reference: 
Motamedi, H., J. Bacteriol. 178: 5243-5248 (1996) 
Sequence 5: Locus: STMOLEP; Accession number: L37200; Reference: 

Rodriguez, A.M., FEMS Microbiol. Lett. 127: 117-120 (1995) 
Sequence 6: Locus: SERCP450A; Accession number: M83110; Reference: 
30 Andersen, J.F. and Hutchinson, C.R., J. Bacteriol. 174: 725-735 

(1992) 

Alignments were performed using an implementation of the algorithm of 
Myers, E.W. and W. Miller. 1988. CABIOS 4:1, 11-17., the Align program from 
Scientific and Educational Software (Durham, North Carolina, USA). Three highly 
conserved regions were identified in the I-helix, containing the oxygen binding 
domain, in the K-helix, and spanning the B-bulge and L-helix containing the 
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5 conserved heme binding domain. Primers were designed to the three conserved 
regions identified in the alignment. Primers P450-r (SEQ ID NO:23) and P450-la'" 
(SEQ ID NO:24) were designed from the I helix, Primer P450-2'' (SEQ K) NO:25) 
was designed from the B-Bulge and L-helix region and Primer P450-3"( SEQ ID 
NO:27) was designed as the reverse complement to the heme binding protein. 

10 Genomic fragments were then amplified via polymerase chain reaction (PGR). 

After PGR amplification, the reaction products were separated by gel electrophoresis 
and fragments of the expected size were excised. The DNA was extracted from the 
agarose gel slices using the Qiaquick gel extraction procedure (Qiagen. Santa Glarita, 
Galifomia, USA). The fragments were then cloned into the PCRscript vector 

15 (Stratagene, La JoUa, California, USA) using the PGRscript Amp cloning kit 

(Stratagene), Colonies containing inserts were picked to 1-2 ml of LB broth with 100 
|ag/ml ampicillin, 30-37^G, 16-24 hours, 230-300 rpm. Plasmid isolation was 
performed using the Mo Bio miniplasmid prep kit (Mo Bio, Solano Beach, Califomia, 
USA). This plasmid DNA was used as a PGR and sequencing template and for 

20 restriction digest analysis. 

The cloned PGR products were sequenced using the Big-Dye sequencing kit 
from Applied Biosystems, (Foster City, Galifomia, USA) and were analyzed using the 
ABI310 sequencer (Applied Biosystems, Foster City, Califomia, USA). The sequence 
of the inserts was used to perform a TblastX search, using the protocol of Altschul, 

25 S.F, et a/., Mol. Biol. 215:403-410 (1990), of the non-redundant protein database. 
Unique sequences having a significant similarity to known cytochrome P450 proteins 
were retained. Using this approach, a total of nine different P450 sequences were 
identified from SC15847, seven from the genomic DNA template and two from the 
cDNA. Two P450 sequences were found in common between the DNA and cDNA 

30 templates. Of the fifty cDNA clones analyzed, two sequences were predominant, 
with twenty clones each. These two genes were then cloned from the genomic DNA. 

The nucleic acid sequence of the genomic DNA was determined using the 
Big-Dye sequencing system (Applied Biosystems) and analyzed using an ABI310 
sequencer. This sequence is depicted in SEQ ID NO: 1. An open reading frame 

35 coding for a protein of 404 amino acids and a predicted molecular weight of 44.7 kDa 
was found within the cloned Bglll fragment. The deduced amino acid sequence of 
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5 this polypeptide is depicted in SEQ ID NO: 2. The amino acid sequence of this 
polypeptide was found to share 51% identity with the NikF protein of Streptomyces 
tendae (Bruntner, C. et al, 1999, Mol. Gen. Genet. 262: 102-1 14) and 48% identity 
with the Sca-2 protein of 5. carbophilus (Watanabe, L Et al, 1995, Gene 163: 81-85). 
Both of these enzymes belong to the cytochrome P450 family 105. The invariable 

10 cysteine found in the heme-binding domain of all cytochrome P450 enzymes is found 
at residue 356. This gene for epothilone B hydroxylase has been named ebk The 
ATG start codon of a putative ferredoxin gene of 64 amino acids is found nine 
basepairs downstream from the stop codon of ebh. This enzyme was found to share 
50% identity with ferredoxin genes of S. griseoulus (O'Keefe, D.P., et al, 1991, 

15 Biochemistry 30: 447-455) and S, noursei (Brautaset, T., et al, 2000, Chem. Biol. 7: 
395-403). The nucleic acid sequence encoding this ferredoxin is depicted in SEQ ID 
NO: 3 and the amino acid sequence for this ferredoxin polypeptide is depicted in SEQ 
ID N0:4. 

The ebh gene sequence was also used to isolate variant cytochrome P450 
20 genes from other microorganisms. Exemplary variant polynucleotides ebh4349\, 
ebhl4930, efcA53630, eM53550, eM39444, eb/i43333 and ebh35\65 of the present 
invention and the species from which they were isolated are depicted in Table 1 
below. The nucleic acid sequences for these variants are depicted in SEQ ID NO:36- 
42, respectively. 
25 Table 1: Variant polynucleotides 



ATCCID 


Species 


ebh gene designation 


43491 


AmvcolatoDsis orientalis 


ebMM91 


14930 


Amvcolatopsis orientalis 


ebhimO 


53630 


Amvcolatomis orientalis 


ebhS3630 


53550 


Amvcolatopsis orientalis 


eM53550 


39444 


Amvcolatopsis orientalis 


efr/r39444 


43333 


Amvcolatopsis orientalis 


ebM3333 


35165 


Amvcolatopsis orientalis 


«*/i35165 



The amino acid sequences encoded by the exemplary variants ebh4349l, 
ebhl4930, efc/i53630. ebh53550, eM39444, ebA43333 and efc/i35165 are depicted in 
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5 SEQ ID NO:43-49, respectively. Table 2 provides a summary of the amino acid 
substitutions of these exemplary variants. 



Table 2: Amino acid Substitutions 



Position 


ebh 


Substitution 


ebh variant 


100 


Gly 


Ser 


e&;il4930, ebh43333, ebh53550, efc/i43491 


101 


Lys 


Arg 


14930 


130 


Ue 


Leu 


eM14930 


192 


Ser 


Gin 


eM14930 


224 


Ser 


Thr 


ebhl4930, ebh43333, ebh53550, eftM3491 


285 


De 


Val 


ebhH930, ebh43333, ebh53550, ebh4349\ 


69 


Ser 


Asn 


ebh43333 


256 


Val 


Ala 


ebh43333, ebh53550, cZ?M3491 


93 
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efc/i53550, ebh4349\ 
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ebh4349\ 


398 
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efe;i39444 



Mutations were also introduced into the coding region of the ebh gene to 

10 identify mutants with improved yield, and/or rate of biocon version and/or altered 
substrate specificity. Exemplary mutant nucleic acid sequences of the present 
invention are depicted in SEQ ID NO:30, 32, 34, 60, 62, 64, 66, 68, 70, 72 and 74, 

The nucleic acid sequence of SEQ ID NO: 30 encodes a mutant ebh25-\ which 
exhibits altered substrate specificity. Plasmid pANT849efe/i25-l containing this 

15 mutant gene was deposited and accepted by an International Depository Authority 
under the provisions of the Budapest Treaty. The deposit was made on November 

, 2002 to the American Type Culture Collection at 10801 University Boulevard 

in Manassas, Virginia 201 10-2209. The ATCC Accession Number is 
. All restrictions upon public access to this plasmid will be 

20 irrevocably removed upon granting of this patent application. The Deposit will be 
maintained in a public depository for a period of thirty years after the date of deposit 
or five years after the last request for a sample or for the enforceable life of the patent, 
whichever is longer. The above-referenced plasmid was viable at the time of the 
deposit. The deposit will be replaced if viable samples cannot be dispensed by the 

25 depository. 

This S. lividans transformant identified in the screening of mutation 25 
(primers NPB29-mut25f (SEQ ID NO:58) and NPB29-mut25r (SEQ ID NO:59)) was 
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5 found to produce a product with a different HPLC elution time than epothilone B or 
epothilone F. A sample of this unknown was analyzed by LC-MS and was found to 
have a molecular weight of 523 (M.W.), consistent with a single hydroxylation of 
epothilone B. Plasmid DNA was isolated from the 5. lividans culture and used as a 
template for PGR amplification using primers NPB29-6f (SEQ ID NO:28) and 

10 NPB29-7r (SEQ ID NO:29) (see Example 17). The expected fragment was obtained 
and sequenced using the Big-Dye sequencing system. The ebh25-\ mutant was found 
to have two mutations resulting in changes in the amino acid sequence of the protein, 
asparagine 195 is changed to serine and serine 294 is changed to proline. The position 
targeted for mutation at codon 238 was found to have a two nucleotide change, which 

15 did not result in a change of the amino acid sequence of the protein. The amino acid 
sequence of the mutant polypeptide encoded by SEQ ID NO: 30 is depicted in SEQ ID 
N0:31. 

The nucleic acid sequence of SEQ ID NO:32 encodes a mutant efe/ilO-53, 
which exhibits improved bioconversion yield. This 5. lividans transformant identified 

20 in the screening of mutation 10 (primers NPB29-mutlOf (SEQ ID NO:54) and 

NPB29-mutlOr (SEQ ID NO:55)) produced a greater yield of epothilone F. Plasmid 
DNA was isolated from the S. lividans culture and used as a template for PGR 
amplification using primers NPB29-6f (SEQ ED NO:28) and NPB29-7r (SEQ ID 
NO:29)(see Example 16). The expected fragment was obtained and sequenced using 

25 the Big-Dye sequencing system. The efcA 10-53 mutant was found to have two 

mutations resulting in changes in the amino acid sequence of the protein, glutamic 
acid 231 is changed to arginine and phenylalanine 190 is changed to tyrosine. The 
position 231 was the target of the mutagenesis, the change at residue 190 is an 
inadvertent change that is an artifact of the mutagenesis procedure. The amino acid 

30 sequence of the mutant polypeptide encoded by SEQ ID NO:32 is depicted in SEQ ID 
NO:33. 

The nucleic acid sequence of SEQ ID NO:34 encodes a mutant ebh24-l6, 
which also exhibits improved bioconversion yield. This 5. lividans transformant, 
€bh2A'\6 identified in the screening of mutation 24 (primers NPB29-mut24f (SEQ ED 
35 NO:56) and NPB29-mut24r (SEQ ID NO:57) also produced a greater yield of 

epothilone F. Plasmid DNA was isolated from the 5. lividans culture and used as a 
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5 template for PGR amplification using primers NPB29-6f (SEQ ID NO:28) and 
NPB29-7r (SEQ ID NO:29). The expected fragment was obtained and sequenced 
using the Big-Dye sequencing system. The ebh24-\6 mutant was found to have two 
mutations resulting in changes in the amino acid sequence of the protein, 
phenylalanine 237 is changed to alanine and isoleucine 92 is changed to valine. The 
10 position 237 was the target of the mutagenesis* the change at residue 92 is an 

inadvertent change that is an artifact of the mutagenesis procedure. The amino acid 
sequence of the mutant polypeptide encoded by SEQ ID NO:34 is depicted in SEQ ID 
NO:35. 

The nucleic acid sequence of SEQ ID NO:60 encodes a mutant ebh24-l6dS, 

15 which also exhibits improved biocon version yield. This S. rimosus transformant, 

eM24-16d8 identified in the screening of mutation 59 (primer NPB29mut59 (SEQ ID 
NO:77)) also produced a greater yield of epothilone F. Plasmid DNA was isolated 
from the 5. rimosus culture and used as a template for PGR amplification using 
primers NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). The expected 

20 fragment was obtained and sequenced using the Big-Dye sequencing system. The 
ebh2A-\6d& mutant was found to have one mutation resulting in a change in the 
amino acid sequence of the protein, arginine 67 is changed to glutamine. This change 
is an artifact of the mutagenesis procedure. The amino acid sequence of the mutant 
polypeptide encoded by SEQ ID NO:60 is SEQ ID N0:61. 

25 The nucleic acid sequence of SEQ ID NO:62 encodes a mutant ebh2A-\6c\\, 

which also exhibits improved bioconversion yield. This 5. rimosus transformant, 
eZ?/i24-16cll identified in the screening of mutation 59 (primer NPB29mut59 (SEQ 
ED NO:77)) also produced a greater yield of epothilone F. Plasmid DNA was isolated 
from the 5. rimosus culture and used as a template for PGR amplification using 

30 primers NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). The expected 
fragment was obtained and sequenced using the Big-Dye sequencing system. The 
^M24-16cl 1 mutant was found to have two additional mutations resulting in changes 
in the amino acid sequence of the protein, alanine 93 is changed to glycine and 
isoleucine 365 is changed to threonine. The position 93 is the target of the 

35 mutagenesis, the change at 365 is an artifact of the mutagenesis procedure. The 
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5 amino acid sequence of the mutant polypeptide encoded by SEQ ID NO:62 is 
depicted in SEQ ID NO:63. 

The nucleic acid sequence of SEQ ID NO: 64 encodes a mutant eib/z24-16-16, 
which also exhibits improved bioconversion yield. This 5. rimosus transformant, 
e/7/i24-16-16 identified in the screening of random mutants of ebh24-\6 also 

10 produced a greater yield of epothilone P. Plasmid DNA was isolated from the 5. 

rimosus culture and used as a template for PGR amplification using primers NPB29- 
6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). The expected fragment was 
obtained and sequenced using the Big-Dye sequencing system. The eZ?/i24-16-16 
mutant was found to have one additional mutation resulting in changes in the amino 

15 acid sequence of the protein, valine 106 is changed to alanine. The amino acid 

sequence of the mutant polypeptide encoded by SEQ ID NO:64 is depicted in SEQ ED 
NO:65. 

The nucleic acid sequence of SEQ ID NO:66 encodes a mutant efc/i24- 16-74, 
which also exhibits improved bioconversion yield. This S. rimosus transformant, 

20 eM24- 16-74 identified in the screening of random mutants of ebhlA-\6 also 
produced a greater yield of epothilone F. Plasmid DNA was isolated from the S. 
rimosus culture and used as a template for PGR amplification using primers NPB29- 
6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). The expected fragment was 
obtained and sequenced using the Big-Dye sequencing system. The ebhlA-ld-lA 

25 mutant was found to have one additional mutation resulting in changes in the amino 
acid sequence of the protein, arginine 88 is changed to histidine. The amino acid 
sequence of the mutant polypeptide encoded by SEQ ID NO:66 is SEQ ID NO:67. 

The nucleic acid sequence of SEQ ID NO:68 encodes a mutant ei)/i24-M18, 
which also exhibits improved bioconversion yield. This S. rimosus transformant, 

30 ebhM-\?> identified in the screening of random mutants of ebh also produced a 

greater yield of epothilone F. Plasmid DNA was isolated from the S. rimosus culture 
and used as a template for PGR amplification using primers NPB29-6f (SEQ ID 
NO:28) and NPB29-7r (SEQ ID NO:29), The expected fragment was obtained and 
sequenced using the Big-Dye sequencing system. The ebhlA-\% mutant was found to 

35 have two mutations resulting in changes in the amino acid sequence of the protein, 
glutamic acid 31 is changed to lysine and methionine 176 is changed to valine. The 
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5 amino acid sequence of the mutant polypeptide encoded by SEQ ID NO:68 is 
depicted in SEQ ID NO:69. 

The nucleic acid sequence of SEQ ID NO:72 encodes a mutant €bh24-l6g%, 
which also exhibits improved bioconversion yield. This S. rimosus transformant, 
ebh24-l6gS identified in the screening of mutation 50 (primer NPB29mut50 (SEQ ID 

10 NO:78)) also produced a greater yield of epothilone F. Plasmid DNA was isolated 
from the S. rimosus culture and used as a template for PGR amplification using 
primers NPB29-6f (SEQ ED NO:28) and NPB29-7r (SEQ ID NO:29). The expected 
fragment was obtained and sequenced using the Big-Dye sequencing system. The 
eM24-16g8 mutant was found to have two additional mutations resulting in changes 

15 in the amino acid sequence of the protein, methionine 176 is changed to alanine and 
isoleucine 130 is changed to threonine. The position 176 is the target of the 
mutagenesis, the change at 130 is an artifact of the mutagenesis procedure. The 
amino acid sequence of the mutant polypeptide encoded by SEQ ID NO:72 is 
depicted in SEQ ID NO:73. 

20 The nucleic acid sequence of SEQ ID NO:74 encodes a mutant £i?/z24-16b9, 

which also exhibits improved bioconversion yield. This S, rimosus transformant, 
ei/i24-16b9 identified in the screening of mutation 50 (primer NPB29mut50 (SEQ ID 
NO:78)) also produced a greater yield of epothilone F. Plasmid DNA was isolated 
from the 5. rimosus culture and used as a template for PGR amplification using 

25 primers NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). The expected 
fragment was obtained and sequenced using the Big-Dye sequencing system. The 
^M24-16b9 mutant was found to have two additional mutations resulting in changes 
in the amino acid sequence of the protein, methionine 176 is changed to serine and 
alanine 140 is changed to threonine. The position 176 is the target of the mutagenesis, 

30 the change at 140 is an artifact of the mutagenesis procedure. The amino acid 

sequence of the mutant polypeptide encoded by SEQ ID NO:74 is depicted in SEQ ID 
NO:75. 

A mixture composed of the plasmids pANT849eM-24-16, pANT849eM-10- 
53, pANT849efc/i-24-16d8, pANT849eZ?A-24-16cll, pANT849efcA-24-16-16, 
35 pant849ef?/i-24-16-74. pANT849efr/i-24-16b9, pANT849eZ?/i-M18 and pANTmebh- 
24-16g8 for these nine mutant genes was deposited and accepted by an International 
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5 Depository Authority under the provisions of the Budapest Treaty. The deposit was 

made on November , 2002 to the American Type Culture Collection at 10801 

University Boulevard in Manassas, Virginia 201 10-2209. The ATCC Accession 

Number is . All restrictions upon public access to this mixture of 

plasmids will be irrevocably removed upon granting of this patent application. The 

10 deposit will be maintained in a public depository for a period of thirty years after the 
date of deposit or five years after the last request for a sample or for the enforceable 
life of the patent, whichever is longer. The above-referenced mixture of plasmids was 
viable at the time of the deposit. The deposit will be replaced if viable samples 
cannot be dispensed by the depository. 

15 Thus, in accordance with another aspect of the present invention, there are 

provided isolated polypeptides of epothilone B hydroxylase and variants and mutants 
thereof and isolated polypeptides of ferredoxin or variants thereof. In one 
embodiment of the present invention, by "polypeptide" it is meant to include the 
amino acid sequence of SEQ ID NO: 2, and fragments or variants, which retain 

20 essentially the same biological activity and/or function as this epothilone B 

hydroxylase. In another embodiment of the present invention, by "polypeptide** it is 
meant to include the amino acid sequence of SEQ ID N0:4, and fragments and/or 
variants, which retain essentially the same biological activity and/or function as this 
ferredoxin. 

25 By "variants" as used herein it is meant to include polypeptides with amino 

acid sequences with conservative amino acid substitutions as compared to SEQ ID 
NO: 2 or 4 which are demonstrated to exhibit similar biological activity and/or 
function to SEQ ID NO:2 or 4. By "conservative amino acid substitutions" it is 
meant to include replacement, one for another, of the aliphatic amino acids such as 

30 Ala, Val, Leu and lie, the hydroxyl residues Ser and Thr. the acidic residues Asp and 
Glu, and the amide residues Asn and Gin. Exemplary variant amino acid sequences 
of the present invention are depicted in SEQ ID NO:43-49 and the amino acid 
substitutions of these exemplary variants are described in Table 2, supra. 

By "mutants" as used herein it is meant to include polypeptides encoded by 

35 nucleic acid sequences with one or more point mutations, or deletions or additions of 
nucleic acids as compared to SEQ ID NO: 1 or 3, but which still have similar activity 
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5 to the polypeptides encoded by SEQ ID NO: 1 or 3. In a preferred embodiment, 
mutations are made to the nucleic acid that alter the substrate specificity and/or yield 
from the polypeptide encoded thereby. A preferred region of mutation with respect to 
the epothilone B hydroxylase gene is that region of the nucleic acid sequence coding 
for the approximately 1 13 amino acid residues comprising the active site of the 

0 enzyme. Also preferred are mutants with at least one amino acid substitution at 
amino acid position GLU31, ARG67, ARG88, ILE92, ALA93, VAL106, ILE130, 
ALAMO, MET176, PHE190, GLU 231, SER294, PHE237, or ILE365 of SEQ ID 
N0:1 Exemplary mutants ebh25-\,ebh\0'53, e&/z24-16, ebh24A6dS, €fcW4-16clI, 
ebh24A6'\6, ei/i24-16-74, efc/i24-16g8, efr/i24-16b9 and the nucleic acid sequences 

5 encoding such mutants of the present invention are depicted in SEQ ID NO:31, 33, 
35, 61, 63, 65, 67, 69, 71, 73 and 75, and SEQ ID NO:30, 32, 34, 60, 62, 64, 66, 68, 
70, 72 and 74, respectively. 

A 3-dimensional model of epothilone B hydroxylase has also been constructed 
in accordance with general teachings of Greer et al. (Comparative modeling of 

0 homologous proteins. Methods In Enzymology 202239-52, 1991), Lesk et al. 
(Homology Modeling: Inferences from Tables of Aligned Sequences. Curr. Op. 
Struc. Biol. (2) 242-247, 1992), and Cardozo et al. (Homology modeling by the ICM 
method. Proteins 23, 403-14, 1995) on the basis of the known structure of a 
homologous protein EryF (PDB Code IKIN chain A). Homology between these 

5 sequences is 34%. Alignment of the sequences of epothilone B hydroxylase (SEQ ID 
N0:2) and EryF (PDB Code IKIN chain A; SEQ ID NO:76) is depicted in Figure 3. 
A homology model of epothilone B hydroxylase based upon sequence alignment with 
EryF is depicted in Figure 4. 

An energy plot of the epothilone B hydroxylase model relative to EryF (PDB 

} code 1 JIN) was also prepared and is depicted in Figure 5. An averaging window size 
of 51 residues was used at a given residue position to calculate the average of the 
energies of the 51 residues in the sequence that lie with the given residue at the central 
position. As shown in Figure 5, all energies along the sequence lie below zero thus 
indicating that the modeled structure as set forth in Figure 4 and Appendix 1 is 

5 reasonable. 
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5 The three-dimensional structure represented in the homology model of 

epothilone B hydroxylase of Figure 4 is defined by a set of structure coordinates as set 
forth in Appendix 1. The term "structure coordinates" refers to Cartesian coordinates 
generated from the building of a homology model. As will be understood by those of 
skill in the art, however, a set of structure coordinates for a protein is a relative set of 

10 points that define a shape in three dimensions. Thus, it is possible that an entirely 
different set of coordinates could define a similar or identical shape. Moreover, slight 
variations in the individual coordinates, as emanate from generation of similar 
homology models using different alignment templates and/or using different methods 
in generating the homology model, will have minor effects on the overall shape. 

15 Variations in coordinates may also be generated because of mathematical 

manipulations of the structure coordinates. For example, the structure coordinates set 
forth in Appendix 1 could be manipulated by fractionalization of the structure 
coordinates; integer additions or subtractions to sets of the structure coordinates, 
inversion of the structure coordinates or any combination of the above. 

20 Various computational analyses are therefore necessary to determine whether 

a molecule or a portion thereof is sufficiently similar to all or parts of epothilone B 
hydroxylase described above as to be considered the same. Such analyses may be 
carried out in current software applications, such as S YB YL version 6.7 or 
BSrSIGHTII (Molecular Simulations Inc., San Diego, CA) version 2000 and as 

25 described in the accompanying User's Guides. 

For example, the superimposition tool in the program SYB YL allows 
comparisons to be made between different structures and different conformations of 
the same structure. The procedure used in S YB YL to compare structures is divided 
into four steps: 1) load the structures to be compared; 2) define the atom equivalencies 

30 in these structures; 3) perform a fitting operation; and 4) analyze the results. Each 
structure is identified by a name. One structure is identified as the target (i.e., the 
fixed structure); the second structure (i.e., moving structure) is identified as the source 
structure. Since atom equivalency within S YB YL is defined by user input, for the 
purpose of this aspect of the present invention equivalent atoms are defined as protein 

35 backbone atoms (N, Ca, C and O) for all conserved residues between the two 
structures being compared. Further, only rigid fitting operations are considered . 
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5 When a rigid fitting method is used, the working structure is translated and rotated to 
obtain an optimum fit with the target structure. The fitting operation uses an algorithm 
that computes the optimum translation and rotation to be applied to the moving 
structure, such that the root mean square difference of the fit over the specified pairs 
of equivalent atoms is an absolute minimum. This number, given in angstroms, is 

10 reported by SYBYL. 

For the purposes of the present invention, any homology model of epothilone 
B hydroxylase that has a root mean square deviation of conserved residue backbone 
atoms (N, Ca, C, O) of less than about 4.0 A when superimposed on the 
corresponding backbone atoms described by structure coordinates listed in Appendix 

15 1 are considered identical. More preferably, the root mean square deviation is less 
than about 3.0 A. More preferably the root mean square deviation is less than about 
2.0 A. 

For the purpose of this invention, any homology model of epothilone B 
hydroxylase that has a root mean square deviation of conserved residue backbone 
20 atoms (N, Co, C, O) of less than about 2.0 A when superimposed on the 

corresponding backbone atoms described by structure coordinates listed in Appendix 
1 are considered identical. More preferably, the root mean square deviation is less 
than about 1.0 A. 

In another embodiment of the present invention, structural models wherein 
25 backbone atoms have been substituted with other elements which when superimposed 
on the corresponding backbone atoms have low root mean square deviations are 
considered to be identical. For example, an homology model where the original 
backbone carbon, and/or nitrogen and/or oxygen atoms are replaced with other 
elements having a root mean square deviation of about 4.0 A, more preferably about 
30 3.0 A, even more preferably less than about 2A, when superimposed on the 

corresponding backbone atoms described by structure coordinates listed in Appendix 
1 is considered identical. 

The term "root mean square deviation" means the square root of the arithmetic 
mean of the squares of the deviations from the mean. It is a way to express the 
35 deviation or variation from a trend or object. For purposes of this invention, the "root 
mean square deviation" defines the variation in the backbone of a protein from the 



-20- 



wo 03/057830 



PCT/US02/40359 



5 relevant portion of the backbone of the epothilone B hydroxylase portion of the 
complex as defined by the structure coordinates described herein. 

The present invention as embodied by the homology model enables the 
structure-based design of additional mutants of epothilone B hydroxylase. For 
example, using the homology model of the present invention, residues lying within 

10 lOA of the binding site of epothilone B hydroxylase have now been defined. These 
residues include LEU39, GLN43, ALA45, MET57, LEU58, HIS62. PHE63, SER64, 
SER65, ASP66, ARG67, GLN68, SER69. LEU74, MET75, VAL76, ALA77, 
ARG78, GLN79, ILE80, ASP84, LYS85, PR086, PHE87. ARG88, PR089, SER90, 
LEU91, ILE92, ALA93, MET94, ASP95, HIS99, ARG103, PHEllO, ILE155, 

15 PHE169, GLNHO, CYS172, SER173, SER174, ARG175, MET176, LEU177, 
SER178, ARG179, ARG186, PHE190. LEU193, VAL233, GLY234, LEU235. 
ALA236, PHE237, LEU238, LEU239. LEU240, ILE241, ALA242, GLY243, 
HIS244, GLU245, THR246, THR247, ALA248, ASN249, MET250, LEU283, 
THR287, ILE288, ALA289, GLU290, THR291, ALA292, THR293, SER294, 

20 ARG295, PHE296, ALA297, THR298, GLU312, GLY313, VAL314, VAL315, 
GLY316, VAL344, ALA345, PHE346, GLY347, PHE348, VAL350, HIS351, 
GLN352, CYS353, LEU354, GLY355, GLN356, LEU358, ALA359, GLU362, 
LYS389, ASP391, SER392,THR393, ILE394 and TYR395 as set forth in Appendix 
1. Mutants with mutations at one or more of these positions are expected to exhibit 

25 altered biological function and/or specificity and thus comprise another embodiment 
of preferred mutants of the present invention. Another embodiment of preferred 
mutants are molecules that have a root mean square deviation from the backbone 
atoms of said epothilone B hydroxylase of not more than about 4.0A. 

The structure coordinates of an epothilone B hydroxylase homology model or 

30 portions thereof are stored in a machine-readable storage medium. Such data may be 
used for a variety of purposes, such as drug discovery. 

Accordingly, another aspect of the present invention relates to machine- 
readable data storage medium comprising a data storage material encoded with the 
structure coordinates set forth in Appendix 1 . 

35 The three-dimensional model structure of epothilone B hydroxylase can also 

be used to identify modulators of biological function and potential substrates of the 
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5 enzyme. Various methods or combinations thereof can be used to identify such 
modulators. 

For example, a test compound can be modeled that fits spatially into a binding 
site in epothilone B hydroxylase, according to Appendix 1. Structure coordinates of 
amino acids within 10 A of the binding region of epothilone B hydroxylase defined by 

10 amino acids LEU39, GLN43, ALA45, MET57, LEU58, H1S62, PHE63. SER64, 
SER65, ASP66, ARG67, GLN68, SER69, LEU74, MET75, VAL76, ALA77, 
ARG78, GLN79, ILE80, ASP84, LYS85, PR086, PHE87, ARG88, PR089, SER90, 
LEU91, ILE92, ALA93, MET94, ASP95, HIS99, ARG103, PHEl 10, ILE155, 
PHE169, GLN170, CYS172, SER173, SER174, ARG175, MET176, LEU177, 

15 SER178, ARG179. ARG186, PHE190, LEU193, VAL233, GLY234, LEU235. 
ALA236, PHE237, LEU238, LEU239, LEU240, ILE241, ALA242, GLY243, 
HIS244, GLU245, THR246, THR247, ALA248, ASN249. MET250, LEU283, 
THR287, rLE288. ALA289, GLU290, THR291, ALA292, THR293, SER294, 
ARG295, PHE296, ALA297, THR298, GLU312, GLY313, VAL314. VAL315, 

20 GLY3 16, VAL344, ALA345, PHE346, GLY347, PHE348, VAL350, HIS35 1 , 
GLN352, CYS353, LEU354, GLY355, GLN356. LEU358, ALA359, GLU362, 
LYS389, ASP391, SER392,THR393, ILE394 and TYR395, and the coordinated 
heme group, HEMl can also be used to identify desirable structural and chemical 
features of such modulators. Identified structural or chemical features can then be 

25 employed to design or select compounds as potential epothilone B hydroxylase 

ligands. By structural and chemical features it is meant to include, but is not limited 
to, covalent bonding, van der Waals interactions, hydrogen bonding interactions, 
charge interaction, hydrophobic bonding interaction, and dipole interaction. 
Compounds identified as potential epothilone B hydroxylase ligands can then be 

30 synthesized and screened in an assay characterized by binding of a test compound to 
epothilone B hydroxylase, or in characterizing the ability of epothilone B hydroxylase 
to modulate a protease target in the presence of a small molecule. Examples of assays 
useful in screening of potential epothilone B hydroxylase ligands include, but are not 
linnited to, screening in silico, in vitro assays and high throughput assays. 

35 As will be understood by those of skill in the art upon this disclosure, other 

structure-based design methods can be used. Various computational structure-based 
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5 design methods have been disclosed in the art. For example, a number of computer 
modeling systems are available in which the sequence of epothilone B hydroxylase 
and the epothilone B hydroxylase structure (i.e., atomic coordinates of epothilone B 
hydroxylase as provided in Appendix 1 and/or the atomic coordinates within lOA of 
the binding region as provided above) can be input. This computer system then 

10 generates the structural details of one or more these regions in which a potential 

epothilone B hydroxylase modulator binds so that complementary structural details of 
the potential modulators can be determined. Design in these modeling systems is 
generally based upon the compound being capable of physically and structurally 
associating with epothilone B hydroxylase. In addition, the compound must be able 

15 to assume a conformation that allows it to associate with epothilone B hydroxylase. 
Some modeling systems estimate the potential inhibitory or binding effect of a 
potential epothilone B hydroxylase substrate or modulator prior to actual synthesis 
and testing. 

Methods for screening chemical entities or fragments for their ability to 

20 associate with a given protein target are also well known. Often these methods begin 
by visual inspection of the binding site on the computer screen. Selected fragments or 
chemical entities are then positioned in a binding region of epothilone B hydroxylase. 
Docking is accomplished using software such as INSIGHTII. QUANTA and S YBYL, 
following by energy minimization and molecular dynamics with standard molecular 

25 mechanic force fields such as, MMFF. CHARMM and AMBER. Examples of 
computer programs which assist in the selection of chemical fragment or chemical 
entities useful in the present invention include, but are not limited to, GRID 
(Goodford. 1985), AUTODOCK (Goodsell, 1990). and DOCK (Kuntz et al. 1982). 
Upon selection of preferred chemical entities or fragments, their relationship 

30 to each other and epothilone B hydroxylase can be visualized and then assembled into 
a single potential modulator. Programs useful in assembling the individual chemical 
entities include, but are not limited to CAVEAT (Bartlett et al. 1989) and 3D 
Database systems (Martin 1992). 

Alternatively, compounds may be designed de novo using either an empty 

35 active site or optionally including some portion of a known inhibitor. Methods of this 
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5 type of design include, but are not limited to LUDI (Bohm 1992) and LeapFrog 
(Tripos Inc., St. Louis MO). 

Programs such as DOCK (Kuntz et al. 1982) can be used with the atomic 
coordinates from the homology model to identify potential ligands from databases or 
virtual databases which potentially bind the in the active site binding region which 

10 may therefore be suitable candidates for synthesis and testing. 

Also provided in the present invention are vectors comprising polynucleotides 
of the present invention and host cells which are genetically engineered with vectors 
of the present invention to produce epothilone B hydroxylase or active fragments and 
variants or mutants of this enzyme and/or ferredoxin or active fragments thereof. 

15 Generally, any vector suitable to maintain, propagate or express polynucleotides to 
produce these polypeptides in the host cell may be used for expression in this regard. 
In accordance with this aspect of the invention the vector may be, for example, a 
plasmid vector, a single- or double-stranded phage vector, or a single- or double- 
stranded RNA or DNA viral vector. Vectors may be extra-chromosomal or designed 

20 for integration into the host chromosome. Such vectors include, but are not limited to, 
chromosomal, episomal and vims-derived vectors e.g., vectors derived from bacterial 
plasmids, bacteriophages, yeast episomes, yeast chromosomal elements, and viruses 
such as baculoviruses, papova viruses, S V40, vaccinia viruses, adenoviruses, fowl 
pox viruses, pseudorabies viruses and retroviruses, and vectors derived from 

25 combinations thereof, such as those derived from plasmid and bacteriophage genetic 
elements, cosmids and phagemids. 

Useful expression vectors for prokaryotic hosts include, but are not limited to, 
bacterial plasmids, such as those from E. coli. Bacillus or Streptomyces, including 
pBluescript, pGEX-2T, pUC vectors, pET vectors, ColEl, pCRl, pBR322, pMB9, 

30 pCW, pBMS200, pBMS2020, PUlOl, PU702, pANT849, pOJ260, pOJ446, 
pSET152, pKCl 139, pKC1218, pFD666 and their derivatives, wider host range 
plasmids, such as RP4, phage DNAs, e,g., the numerous derivatives of phage lambda, 
e.g., NM989, XGTIO and XGTl 1, and other phages, ^.g., M13 and filamentous single 
stranded phage DNA. 

35 Vectors of the present invention for use in yeast will typically contain an 

origin of replication suitable for use in yeast and a selectable marker that is functional 
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5 in yeast. Examples of yeast vectors useful in the present invention include, but are not 
limited to. Yeast Integrating plasmids {e.g., YIp5) and Yeast Replicating plasmids 
(the YRp and YEp series plasmids), Yeast Centromere plasmids (the YCp series 
plasmids), Yeast Artificial Chromosomes (YACs) which are based on yeast linear 
plasmids, denoted YLp, pGPD-2, 2\i plasmids and derivatives thereof, and improved 

10 shuttle vectors such as those described in Gietz et al. Gene, 14: 527-34 (1988) 
(YIplac, YEplac and YCplac). 

Mammalian vectors useful for recombinant expression may include a viral 
origin, such as the S V40 origin (for replication in cell lines expressing the large 
T-antigen, such as COSl and C0S7 cells), the papillomavirus origin, or the EBV 

15 origin for long term episomal replication (for use, e.g., in 293-EBNA cells, which 
constitutively express the EBV EBNA-1 gene product and adenovirus El A). 
Expression in mammalian cells can be achieved using a variety of plasmids, 
including, but not limited to, pSV2, pBC12BI, and p91023, pCDNA vectors as well as 
lytic virus vectors (e.g., vaccinia virus, adeno virus, and baculovirus), episomal virus 

20 vectors (e.g., bovine papillomavirus), and retroviral vectors (e.g., murine 

retroviruses). Useful vectors for insect cells include baculoviral vectors and pVL941. 

Selection of an appropriate promoter to direct mRNA transcription and 
construction of expression vectors are well known. In general, however, expression 
constructs will contain sites for transcription initiation and termination, and, in the 

25 transcribed region, a ribosome binding site for translation. The coding portion of the 
mature transcripts expressed by the constructs will include a translation initiating 
codon at the beginning and a termination codon appropriately positioned at the end of 
the polypeptide to be translated. 

Examples of useful promoters for prokaryotes include, but are not lunited to 

30 phage promoters such as phage lambda pL promoter, the trc promoter, a hybrid 
derived from the trp and lac promoters, the bacteriophage T7 promoter, the TAC or 
TRC system, the major operator and promoter regions of phage lambda, the control 
regions of fd coat protein, snpA promoter, melC promotor, ermE* promoter or the 
aroBAiD operon. Examples of useful promoters for yeast include, but are not limited 

35 to, the CYCl promoter, the GALl promoter, the GALIO promoter, ADHl promoter, 
the promoters of the yeast a-mating system, and the GPD promoter. Examples of 
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5 promoters routinely used in mammalian expression vectors include, but are not 

limited to, the CMV immediate early promoter, the HS V thyniidine kinase promoter, 
the early and late SV40 promoters, the promoters of retroviral LTRs, such as those of 
the Rous Sarcoma Virus(RSV), and metallothionein promoters, such as the mouse 
metallothionein-I promoter. 

10 Vectors comprising the polynucleotides can be introduced into host cells using 

any number of well known techniques including infection, transduction, transfection, 
transvection and transformation. The polynucleotides may be introduced into a host 
alone or with additional polynucleotides encoding, for example, a selectable marker 
or ferredoxin reductase. In a preferred embodiment of the present invention the 

15 polynucleotide for epothilone B hydroxylase and ferredoxin are introduced into the 
host cell. Host cells for the various expression constructs are well known, and those 
of skill can routinely select a host cell for expressing the epothilone B hydroxylase 
and/or ferredoxin in accordance with this aspect of the present invention. Examples 
of mammalian expression systems useful in the present invention include, but are not 

20 limited to, the CI 27, 3T3, CHO, HeLa, human kidney 293 and BHK cell lines, and 
the COS-7 line of monkey kidney fibroblasts. 

Alternatively, as exemplified herein, epothilone B hydroxylase and ferredoxin 
can be expressed recombinantly in microorganisms. 

Accordingly, another aspect of the present invention relates to recombinantly 

25 produced niicroorganisms which express epothilone B hydroxylase alone or in 

conjunction with the ferredoxin and which are capable of hydroxylating a compound , 
and in particular an epothilone, having a terminal alkyl group to produce ones having 
a terminal hydroxyalkyl group. The recombinantly produced microorganisms are 
produced by transforming cells such as bacterial cells with a plasnfiid comprising a 

30 nucleic acid sequence encoding epothilone B hydroxylase. In a preferred 

embodiment, the cells are transformed with a plasmid comprising a nucleic acid 
encoding epothilone B hydroxylase or mutants or variants thereof as well as the 
nucleic acid sequence encoding ferredoxin located downstream of the epothilone B 
hydroxylase gene. Examples of microorganisms which can be transformed with these 

35 plasmids to produce the recombinant microorganisms of the present invention 

include, but are not limited, Escherichia coli, Bacillus megaterium, Amycolatopsis 
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5 orientalis, Sorangium cellulosum, Rhodococcus erythropolis, and Streptomyces 
species such as Streptomyces lividans, Streptomyces virginiae, Streptomyces 
venezuelae, Streptomyces albus, Streptomyces coelicolor, Streptomyces rimosus and 
Streptomyces griseus. 

The recombinantly produced microorganisms of the present invention are 

10 useful in microbial processes or methods for production of compounds, and in 
particular epothilones, containing a terminal hydroxyalkyi group. In general, the 
hydroxyalkyl-bearing product can be produced by culturing the recombinantly 
produced microorganism or enzyme derived therefrom, capable of selectively 
hydroxylating a terminal carbon or alkyl, in the presence of a suitable substrate in an 

15 aqueous nutrient medium containing sources of assimilable carbon and nitrogen, 
under submerged aerobic conditions. 

Suitable epothilones employed as substrate for the method of the present 
invention may be any such compound having a terminal carbon or terminal alkyl 
group capable of undergoing the enzymatic hydroxylation of the present invention. 

20 The starting material, or substrate, can be isolated from natural sources, such as 
Sorangium cellulosum, or they can be synthetically formed epothilones. Other 
substrates having a terminal carbon or terminal alkyl group capable of undergoing an 
enzymatic hydroxylation can be employed by the methods herein. For example, 
compactin can be used as a substrate, which upon hydroxylation forms the compound 

25 pravastatin. Methods for hydroxylating compactin to pravastatin via an 

Actinomadura strain are set forth in U.S. Patent 5.942,423 and U.S. Patent 6.274,360. 

For example, using the recombinant microorganisms of the present invention 
at least one epothilone can be prepared as described in WO 00/39276, U. S. Serial. 
No. 09/468,854, filed December 21, 1999, the text of which is incorporated herein as 

30 if set forth at length. An epothilone of the following Formula I 

HO-CH2-(A,)n-(Q)m-(A2)o-E (I) 

where 

Ai and A2 are independently selected from the group of optionally substituted 
CpCa alkyl and alkenyl; 
35 Q is an optionally substituted ring system containing one to three rings and at 

least one carbon to carbon double bond in at least one ring; 
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5 n, m, and o are integers selected from the group consisting of zero and 1 , 

where at least one of m orn or o is 1; and 

E is an epothilone core; can be prepared. 
This method comprises the steps of contacting at least one epothilone of the following 
formula n 

10 CH3-(AOn-(QW(A2)o-E (H) 

where Ai, Q, A2, E, n, m, and 0 are defined as above; 
with a recombinantly produced microorganism, or an enzyme derived 
therefrom, which is capable of selectively catalyzing the hydroxylation of formula H, 
and effecting said hydroxylation. 

15 In a preferred embodiment, the starting material is epothilone B. Epothilone B 

can be obtained from the fermentation of Sorangium cellulosum So ce90, as described 
in DE 41 38 042 and WO 93/10121. The strain has been deposited at the Deutsche 
Sanmilung von Mikroorganismen (German Collection of Microorganisms) (DSM) 
under No. 6773. The process of fermentation is also described in Hofle, G., et al., 

20 Angew. Chem, Int. Ed. Engl., Vol 35, No. 13/14. 1567-1569 (1996). Epothilone B can 
also be obtained by chemical means, such as those disclosed by Meng, D., et al., J. 
Am. Chem. Soc, Vol. 119, No. 42, 10073-10092 (1996); Nicolaou. K., et al., J. Am. 
Chem. Soc, Vol. 119, No. 34, 7974-7991 (1997) and Schinzer, D., et al., Chem. Eur. 
/, Vol. 5, No. 9, 2483-2491 (1999). 

25 Growth of the recombinantly produced microorganism selected for use in the 

process may be achieved by one of ordinary skill in the art by the use of appropriate 
nutrient medium. Appropriate media for the growing of the recombinantly produced 
microorganisms include those that provide nutrients necessary for the growth of 
microbial cells. See, for example, T. Nagodawithana and J. M. Wasileski, Chapter 2: 

30 "Media Etesign for Industrial Fermentations," Nutritional Requirements of 

Commercially Important Microorganism , edited by T. W. Nagodawithana and G. 
Reed, Esteekay Associates, Inc., Milwaukee, WI, 18-45 (1998); T. L, Miller and B. 
W. Churchill, Chapter 10: "Substrates for Large-Scale Fermentations," Manual of 
Industrial Microbiology and Biotechnology , edited by A.L. Demain and N. A. 

35 Solomon, American Society for Microbiology, Washington, D.C., 122-136 (1986). A 
typical medium fof growth includes necessary carbon sources, nitrogen sources, and 
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5 trace elements. Inducers may also be added to the medium. The term inducer as used 
herein, includes any compound enhancing formation of the desired enzymatic activity 
within the recombinantly produced microbial cell. Typical inducers as used herein 
may include solvents used to dissolve substrates, such as dimethyl sulfoxide, dimethyl 
formamide, dioxane, ethanol and acetone. Further, some substrates, such as 

10 epothilone B, may also be considered to be inducers. 

Carbon sources may include sugars such as glucose, fructose, galactose, 
maltose, sucrose, mannitol, sorbital, glycerol starch and the like; organic acids such as 
sodium acetate, sodium citrate, and the like; and alcohols such as ethanol, propanol 
and the like. Preferred carbon sources include, but are not limited to, glucose, 

15 fructose, sucrose, glycerol and starch. 

Nitrogen sources may include an N-Z amine A, corn steeped liquor, soybean 
meal, beef extract, yeast extract, tryptone, peptone, cottonseed meal, peanut meal, 
amino acids such as sodium glutamate and the like, sodium nitrate, ammonium sulfate 
and the like. 

20 Trace elements may include magnesium, manganese, calcium, cobalt, nickel, 

iron, sodium and potassium salts. Phosphates may also be added in trace or 
preferably, greater than trace amounts. 

The medium employed for the fermentation may include more than one 
carbon or nitrogen source or other nutrient. 

25 For growth of the recombinantly produced microorganisms and/or 

hydroxylation according to the method of the present invention, the pH of the medium 
is preferably from about 5 to about 8 and the temperature is from about 14°C to about 
37°C, preferably the temperature is 28°C. The duration of the reaction is 1 to 100 
hours, preferably 8 to 72 hours. 

30 The medium is incubated for a period of time necessary to complete the 

biotransformation as monitored by high performance liquid chromatography (HPLC). 
Typically, the period of time needed to complete the transformation is twelve to one 
hundred hours and preferably about 72 hours after the addition of the substrate. The 
medium is placed on a rotary shaker (New Brunswick Scientific Innova 5000) 

35 operating at 150 to 300 rpm and preferably about 250 rpm with a throw of 2 inches. 
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5 The hydroxyalkyl-bearing product can be recovered from the fermentation 

broth by conventional means that are commonly used for the recovery of other known 
biologically active substances. Examples of such recovery means include, but are not 
limited to, isolation and purification by extraction with a conventional solvent, such as 
ethyl acetate and the like; by pH adjustment; by treatment with a conventional resin, 

10 for example, by treatment with an anion or cation exchange resin or a non-ionic 
adsorption resin; by treatment with a conventional adsorbent, for example, by 
distillation, by crystallization; or by recrystallization, and the like. 

The extract obtained above from the biotransformation reaction mixture can be 
further isolated and purified by column chromatography and analytical thin layer 

15 chromatography. 

The ability of a recombinantly produced microorganism of the present 
invention to biotransform an epothilone having a terminal alkyl group to an 
epothilone having a terminal hydroxyalkyl group was demonstrated. In these 
experiments, a culture comprising a Streptomyces lividans clone containing a plasmid 

20 with the ebh gene as described in more detail in Example 1 1 was incubated with an 
epothilone B suspension for 3 days at 30°with agitation. A sample of the incubate 
was extracted with an equal volume of 25% methanol: 75% n-butanol, vortexed and 
allowed to settle for 5 minutes. Two hundred |il of the organic phase was transferred 
to an HPLC vial and analyzed by HPLC/MS (Example 12). A product peak of 

25 epothilone F eluted at a retention time of 15.9 minutes and had a protonated molecular 
weight of 524. The epothilone B substrate eluted at 19.0 minutes and had a 
protonated molecular weight of 508. The peak retention times and molecular weights 
were confirmed using known standards. 

Rates of biotransformation of epothilone B by cells expressing ebh were also 

30 compared to rates of biotransformation by ebh mutants. Cells expressing ebh 
comprised a frozen spore preparation of. 5. lividans (pANT849-efe/i). Cells 
expressing mutants comprises frozen spore preparations of S. lividans (pANT849- 
e/?/ilO-53) and5. KvK/flTW (pANT849-^Z>/i24-16). A frozen spore preparation of 5. 
lividans TK24 was used as the control. The cells were pre-incubated for several days 

35 at SO^'C. Following this pre-incubation, epothilone B in 100% EtOH was added to 
each culture to a final concentration of 0.05% weight/volume. Samples were then 
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5 taken at 0, 24, 48 and 72 hours with the exception of the S. lividans (pANT849- 
ebh24'l6) culture, in which the epothilone B had been completely converted to 
epothilone F at 48 hours. The samples were analyzed by HPLC. The results are 
calculated as a percentage of the epothilone B at time 0 hours. 

10 Epothilone B: 



Time (hours) 


TK24 


pANT849^*li 


pANT849-«*A10-53 


pANT849-eM24-16 


0 


100% 


100% 


100% 


100% 


24 


99% 


78% 


69% 


56% 


48 


87% 


19% 


39% 


0% 


72 


87% 


0% 


3% 




Epothilone F: 


Time (hours) 


TK24 


pANT849-e»A 


pANT849-«M10-53 


pANT849-eM24-16 


0 


0% 


0% 


0% 


0% 


24 


0% 


4% 


9% 


23% 


48 


0% 


21% 


29% 


52% 


72 


0% 


14% 


41% 





The ability of cells expressing ebh to biotransform compactin to pravastatin 
15 was also examined. In these experiments, frozen spore preparations of 5. lividans 
(pANT849) or S. lividans (pANT849-efe/i) were grown for several days at SO'^C. 
Following the pre-incubation, an aliquot of each cell culture was transferred to a 
polypropylene culture tube, compactin was added to each culture tube, and the tubes 
were incubated for 24 hours. 30°C. 250 rpm. An aliquot of the culture broth was then 
20 extracted and compactin and pravastatin values relative to the control 5. lividans 
(pANT849) culture were measured via HPLC. 



Compactin and pravastatin as a percentage of starting compactin 
concentration: 





5. lividans (pANT849) 


S. lividans (pANTm-ebh) 


Compactin 


36% 


11% 


Pravastatin 


11% 


53% 
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5 As discussed supra, mutant ebh25A (SEQ ID NO:30) exhibits altered 

substrate specificity and biotransformation of epothilone B by this mutant resulted in 
a product with a different HPLC elution time than epothilone B or epothilone F. A 
sample of this unknown was analyzed by LC-MS and was found to have a molecular 
weight of 523 (M,W.), consistent with a single hydroxylation of epothilone B. The 

10 structure of the biotransformation product was determined as 24-hydroxyl-epothilone 
B, based on MS and NMR data (compared with data of epothilone B): 



26 




0 OH O 

24-hydroxyl-epothilone B 
Formula A 



15 Molecular Formula: C27H41NO7 S 
Molecular Weight: 523 

Mass Spectrum: ES+ (m/z): 524([M+H]^), 506. 
LC/MS/MS: +ESI (m/z): 524, 506, 476, 436, 320 

HRMS: Calculated for [M-i-H]^ 524.2682; Found: 524.2701 

20 HPLC (Rt) 7.3 minutes (on the analytical HPLC system) 

LC/NMR Observed Chemical Shifts 

Varian AS-600 (Proton: 599.624 MHz), 

Solvent D2O/CD3CN (5 1 .94): -4/6 

Proton: 87.30 (s, IH), 6.43 (s, IH), 5.30 (m, IH), 4.35 (m, IH), 
25 ' 3.81 (m, IH). 3.74 (m, IH), 3.68 (m, IH), 3.43 (m, IH), 2.87 

(m, IH), 2.66 (s, 3H), 2.40 (m, 2H), 1.58 (b, IH), 1.48 (b, IH), 
1.35 (m, 3H), 1.18 (s, 3H), 1.13 (s, 3H), 0.87 (m, 6H) 
*Peaks between 1.8-2.1 ppm were not observed due to solvent 
suppression. 
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The proton chemical shift was assigned as follows: 



10 



15 



20 



25 



30 



35 



Position 


Proton 


Pa 


1 


- 




2 


2.40 


m 


3 


4.35 


m 


4 


~ 




5 


— 




6 


3.43 


m 


7 


3.68 


m 


8 


1.58 


m 


9 


1.35 


b 


10 


1.48 


b 


10 


1.35 


b 


11 


SSP 




12 






13 


2.87 


m 


14 


SSP 




15 


5.30 


m 


16 






17 


6.43 


s 


18 


— 




19 


7.30 


s 


20 


— 




21 


2.66 


s 


22 


1.18 


s 


23 


0.87 


m 


24 


3.81 


m 


24 


3.74 


m 


25 


0.87 


m 


26 


1.13 


s 


27 


SSP 





"SSP: no observed due to solvent suppression. 
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5 Accordingly, the compositions and methods of the present invention are useful 

in producing known compounds that are microtubule-stabilizing agents as well as new 
compounds comprising epothilone analogs such as 24-hydroxyl-epothiione B 
(Formula A) and pharmaceutically acceptable salts thereof expected to be useful as 
microtubule-stabilizing agents. The microtubule stabilizing agents produced using 
10 these compositions and methods are useful in the treatment of a variety of cancers and 
other proliferative diseases including, but not linnited to, the following; 

carcinoma, including that of the bladder, breast, colon, kidney, liver, lung, 
ovary, pancreas, stomach, cervix, thyroid and skin; including squamous cell 
carcinoma; 

15 - hematopoietic tumors of lymphoid lineage, including leukemia, acute 
lymphocytic leukemia, acute lymphoblastic leukemia, B-cell lymphoma. T-cell 
lymphoma, Hodgkins lymphoma, non-Hodgkins lymphoma, hairy cell lymphoma and 
Burketts lymphoma; 

hematopoietic tumors of myeloid lineage, including acute and chronic 
20 myelogenous leukemias and promyelocytic leukemia; 

tumors of mesenchymal origin, including fibrosarcoma and 
rhabdomyoscarcoma; 

other tumors, including melanoma, seminoma, tetratocarcinoma, 
neuroblastoma and glioma; 
25 - tumors of the central and peripheral nervous system, including astrocytoma, 
neuroblastoma, glioma, and schwannomas; 

tumors of mesenchymal origin, including fibrosarcoma, rhabdomyosarcoma, 
and osteosarcoma; and 

other tumors, including melanoma, xenoderma pigmentosum, 
30 keratoactanthoma, seminoma, thyroid follicular cancer and teratocarcinoma. 

Microtubule stabilizing agents produced using the compositions and methods 
of the present invention will also inhibit angiogenesis, thereby affecting the growth of 
tumors and providing treatment of tumors and tumor-related disorders. Such anti- 
angiogenesis properties of these compounds will also be useful in the treatment of 
35 other conditions responsive to anti-angiogenesis agents including, but not limited to, 
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5 certain forms of blindness related to retinal vascularization, arthritis, especially 
inflammatory arthritis, multiple sclerosis, restinosis and psoriasis. 

Microtubule stabilizing agents produced using the compositions and methods 
of the present invention will induce or inhibit apoptosis, a physiological cell death 
process critical for normal development and homeostasis. Alterations of apoptotic 

10 pathways contribute-to the pathogenesis of a variety of human diseases. Compounds 
of the present invention such as those set forth in formula I and n and Formula A, as 
modulators of apoptosis, will be useful in the treatment of a variety of human diseases 
with aberrations in apoptosis including, but not limited to. cancer and precancerous 
lesions, immune response related diseases, viral infections, degenerative diseases of 

15 the musculoskeletal system and kidney disease. 

Without wishing to be bound to any mechanism or morphology, microtubule 
stabilizing agents produced using the compositions and methods of the present 
invention may also be used to treat conditions other than cancer or other proliferative 
diseases. Such conditions include, but are not limited to viral infections such as 

20 herpesvirus, poxvirus, Epstein-Barr virus, Sindbis virus and adenovirus; autoimmune 
diseases such as systemic lupus erythematosus. inMiune mediated glomerulonephritis, 
rheumatoid arthritis, psoriasis, inflammatory bowel diseases and autoimmune diabetes 
mellitus; neurodegenerative disorders such as Alzheimer's disease, AIDS-related 
dementia, Parkinson's disease, amyotrophic lateral sclerosis, retinitis pigmentosa, 

25 spinal muscular atrophy and cerebellar degeneration; AIDS; myelodysplastic 

syndromes; aplastic anemia; ischemic injury associated myocardial infarctions; stroke 
and reperfusion injury; restenosis; arrhythmia; atherosclerosis; toxin-induced or 
alcohol induced liver diseases; hematological diseases such as chronic anemia and 
aplastic anemia; degenerative diseases of the musculoskeletal system such as 

30 osteoporosis and arthritis; aspirin-sensitive rhinosinusitis; cystic fibrosis; multiple 

sclerosis; kidney diseases; and cancer pain. 

The following nonlimiting examples are provided to further illustrate the 
present invention. 
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5 EXAMPLES 

Example 1: Reagents 

R2 Medium was prepared as follows: 

A solution containing sucrose (103 grams), K2SO4 (0.25 grams) MgCl2o6H20 
10 (10.12 grams), glucose (10 grams), Difco Casaminoacids (0.1 grams) and distilled 
water (800 ml) was prepared. Eighty ml of this solution was then poured into a 200 
ml screw capped bottle containing 2.2 grams Difco Bacto agar. The bottle was 
capped and autoclaved. At time of use, the medium was remelted and the following 
autoclaved solutions were added in the order listed: 
15 1 ml KH2PO4 (0.5%) 

8 ml CaCWHjO (3.68%) 

L5 ml L-proline (20%) 

10 ml TES buffer (5.73%, adjusted to pH 7.2) 

0.2 ml Trace element solution containing ZnCl2(40mg), FeCl3o6H2O(200 mg), 
20 CUCI202H2O (10 mg), MnCl2o4H20 (10 mg), Na2B4O7ol0H2O (10 mg), and 

(NH4)6M070240H20 

0.5 ml NaOH (lN)(sterilization not required) 

0.5 ml Required growth factors for auxotrophs (Histidine (50 |ig/ml); Cysteine 
(37 |Xg/m]); adenine, guanine, thymidine and uracil (7.5 |ig/ml); and Vitamins (0.5 
25 [ig/ml). 

R2YE medium was prepared in the same fashion as R2 medium. However, 5 ml of 
Difco yeast extract (10%) was added to each 100 ml flask at time of use. 

30 P (protoplast) buffer was prepared as follows: 

A basal solution made up of the following was prepared: 

Sucrose (103 grams) 

K2SO4 (0.25 grams) 

MgCl2o6H20 (2.02 grams) 
35 Trace Element Solution as described for R2 medium (2 ml) 

Distilled water to 800 ml 
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5 Eighty ml aliquots of the basal solution were then dispensed and autoclaved. Before 
use, the following was added to each flask in the order listed: 

1 ml KH2PO4 (0.5%) 

10 ml CaCl2o2H20 (3.68%) 

TES buffer (5.75%, adjusted to pH 7.2) 
10 T (transformation) buffer was prepared by mixing the following sterile solutions: 



15 The following are then added to 9.3 mis of this solution: 
0.2 ml CaCl2 (5M) 

0.5 ml Tris maleic acid buffer prepared from 1 M solution of Tris adjusted to 
pH 8.0 by adding maleic acid. 

For use, 3 parts by volume of the above solution are added to 1 part by weight of PEG 



25 ml Sucrose (10.3%) 



75 ml distilled water 



1 ml Trace Element Solution as described for R2 medium 



1 ml K2SO4 (2.5%) 




20 1000, previously sterilized by autoclaving. 



25 



L (lysis) buffer was prepared by mixing the following sterile solutions: 
100 nil Sucrose (10.3%) 
10 ml TES buffer (5.73%, adjusted to pH 7.2) 
1 ml K2SO4 (2.5%) 



1 ml Trace Element Solution as described for R2 medium 



1 ml KH2PO4 (0.5%) 



0.1mlMgCl2o6H2O (2.5 M) 
1 ml CaCl2 (0.25 M) 
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5 CRM Medium 

A solution containing the following components was prepared in 1 liter of 
dHaO: glucose (10 grams), sucrose (103 grams), MgCl2o6H20 (10.12 grams), BBL™ 
trypticase soy broth (15 grams) (Becton Dickinson Microbiology Systems, Sparks, 
Maryland, USA), and BBL''''^ yeast extract (5 grams) (Becton Dickinson 
1 0 Microbiology Systems). The solution was autoclaved for 30 minutes. Thiostrepton 
was added to a concentration of 10 |lg/nil for cultures propagated with plasmids. 

Electroporation Buffer 

A solution containing 30% (wt/vol) PEG 1000, 10% glycerol, and 6.5 % 
15 sucrose was prepared in dH20, The solution was sterilized by vacuum filtration 
through a 0.22 ^m cellulose acetate filter. 



Example 2: Extraction of Chromosomal DNA from Strain SC 15847 

Genomic DNA was isolated from an Amycolatopsis orientalis soil isolate 
20 strain designation SC15847 (ATCC PT-1043) using a guanidine-detergent lysis 
method, DNAzol reagent (Invitrogen, Carlsbad, California, USA). The SCI 5847 
culture was grown 24 hours at 28°C in F7 medium (glucose 2.2%, yeast extract 1.0%, 
malt extract 1.0 %, peptone 0.1%, pH 7.0). Twenty ml of culture was harvested by 
centrifugation and resuspended in 20 ml of DNAzol, mixed by pipetting and 
25 centrifuged 10 minutes in the Beckman TJ6 centrifuge. Ten ml of 1(X)% ethanol was 
added, inverted several times and stored at room temperature 3 minutes. The DNA 
was spooled on a glass pipette washed in 100% ethanol and allowed to air dry 10 
minutes. The pellet was resuspended in 500 ^1 of 8mM NaOH and once dissolved it 
was neutralized with 30 ^il of IM HEPES pH7.2. 

30 

Example 3: PCR Reactions 

PGR reactions were prepared in a volume of 50 jil, containing 200-500 ng of 
genomic DNA or 1.0 ^.1 of the cDNA, a forward and reverse primer, and the forward 
primer being either P450-r (SEQ ID NO:23) or P450-la^(SEQ ID NO:24) or P450- 
35 2'^(SEQ ID NO:25) and the reverse primer P450-3" (SEQ ID NO:27)or P450-2*(SEQ 
ID NO:26). All primers were added to a final concentration of 1.4- 2.0 |xM. The PCR 
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5 reaction was prepared with 1 yd of Taq enzyme (2.5 units) (Stratagene), 5 [il of Taq 
buffer and 4 |al of 2.5 mM of dNTPs with dH20 to 50 The cycling reactions were 
performed on a Geneamp® PCR system with the following protocol: 95°C for 5 
minutes. 5 cycles [95°C 30 seconds, 3TC 15 seconds (30% ramp), 72°C 30 seconds], 
35 cycles (94°C 30 seconds, 65°C 15 seconds, 72°C 30 seconds), 72°C 7 minutes. 

10 The expected sizes for the reactions are 340 bp for the P450-r (SEQ ID NO:23) or 
P450-la^ (SEQ ID NO:24) and P450-3* (SEQ ID NO:27) primer pairs, 240 bp for the 
P450-r (SEQ ID NO:23) and P450-2- (SEQID NO:26) primer pairs and 130 bp for 
the P450-2^ (SEQ ID NO:25) and P450-3" (SEQ ID NO:27) primer pairs. 

15 Example 4: Cloning of Epothilone B Hydroxylase and Ferredoxin Genes 

Twenty )xg of SC 15847 genomic DNA was digested with Bglll restriction 
enzyme for 6 hours at 37°C. A 30k: nanosep column (Gelman Sciences, Ann Arbor, 
Michigan, USA) was used to concentrate the DNA and remove the enzyme and 
buffer. The reactions were concentrated to 40 \il and washed with 200 |il of TE. The 

20 digestion products were then separated a 0.7% agarose gel and genomic DNA in the 
range of 12-15 kb was excised from the gel and purified using the Qiagen gel 
extraction method. The genomic DNA was then ligated to plasmid pWB19N (U.S. 
Patent 5,516,679), which had been digested with BamHI and dephosphorylated using 
the SAP I enzyme (Roche Molecular Biochemicals, Indianapolis, Indiana, catalog#l 

25 758 250). Ligation reactions were performed in a 15 |il volume with lU of T4 DNA 
ligase (Invitrogen) for 1 hour at room temperature. One |il of the ligation was 
transformed to 100 [il of chemically competent DHIOB cells (Invitrogen) and 100 [il 
plated to five LB agar plates with 30 ^ig/ml of neomycin, 37°C overnight. 

Five nylon membrane circles (Roche Molecular Biochemicals, Indianapolis, 

30 Indiana) were numbered and marked for orientation. The membranes were placed on 
the plates 2 minutes and then allowed to dry for 5 minutes. The membranes were then 
placed on Whatman filter disks saturated with 10% SDS for 5 minutes, 0.5N NaOH 
with 1.5 M NaCl for 5 minutes, 1.5 M NaCl with 1.0 M Tris pH 8.0 for 5 minutes, 
and 15 minutes on 2X SSC. The filters were hybridized as described previously for 

35 the Southern hybridization. Hybridizing colonies were picked to 2 ml of TB with 30 
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5 |lg/rril neomycin and grown overnight at ZTC Plasmid DNA was isolated using a 
miniprep column procedure (Mo Bio). This plasmid was named NPB29-1. 

Example 5: DNA Sequencing and Analysis 

The cloned PCR products were sequenced using fluorescent-dye-labeled 
10 terminator cycle sequencing, Big-Dye sequencing kit (Applied Biosystems, Foster 
city, California, USA) and were analyzed using laser-induced fluorescence capillary 
electrophoresis, ABI Prism 310 sequencer (Applied Biosystems). 

Example 6: Extraction of Total RNA 

15 Total RNA was isolated from the SC 15847 culture using a modification of the 

Chomczynski and Sacchi method with a mono-phasic solution of phenol and 
guanidine isothiocyanate, Trizol reagent (Invitrogen). Five ml of an SCI 5847 frozen 
stock culture was thawed and used to inoculate 100 ml of F7 media in a 500 ml 
Erlenmeyer flask. The culture was grown in a shaker incubator at 230 rpm, 30*^C for 

20 20 hours to an optical density at 600 nm (ODeoo) of 9.0. The culture was placed in a 
16°C shaker incubator at 230 rpm for 20 minutes. Fifty-five milligrams of epothilone 
B was dissolved in 1 ml of 100% ethanol and added to the culture. A second ml of 
ethanol was used to rinse the residual epothilone B from the tube and added to the 
culture. The culture was incubated at 16°C, 230 rpm for 30 hours. Thirty ml of the 

25 culture was transferred to a 50 ml tube, 150 mg of lysozyme was added to the culture 
and the culture was incubated 5 minutes at room temperature. Ten ml of the culture 
was placed in a 50 ml Falcon tube and centrifuged 5 minutes, 4°C in a TJ6 centrifuge. 
Two ml of chloroform was added and the tube was mixed vigorously for 15 seconds. 
The tube was incubated 2 minutes at room temperature and centrifuged 10 minutes, 

30 top speed in the TJ6 centrifuge. The aqueous layer was transferred to a fresh tube and 
2.5 ml of isopropanol was added to precipitate the RNA. The tube was incubated 10 
minutes at room temperature and centrifuged 10 minutes, 4°C. The supernatant was 
removed, the pellet was rinsed with 70% ethanol and dried briefly under vacuum. The 
pellet was resuspended in 150 ^ll of RNase-free dHaO. Fifty |il of 7.5M LiCl was 

35 added to the RNA and incubated at -20°C for 30 minutes. The RNA was pelleted by 
centrifugation 10 minutes, 4°C in a microcentrifuge. The pellet was rinsed with 200 jil 
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5 of 70% ethanol, dried briefly under vacuum and resuspended in 1 50 |il of RNase free 
dH20. 

The RNA was treated with DNasel (Ambion, Austin, Texas, USA). Twenty- 
five |il of total RNA (5,3 ing/^l), 2.5 [i\ of DNasel buffer, 1.0 |il of DNase I added 
and incubated at 3TC for 25 minutes. Five |ll of DNase I inactivation buffer added, 
10 incubated 2 minutes, centrifiiged 1 minute, the supernatant was transferred to a fresh 
tube. 



Example 7: cDNA Synthesis 

cDNA was synthesized from the total RNA using the Superscript n enzyme 
15 (Invitrogen). The reaction was prepared with 1 pJ of total RNA (5.3 M-g/jil), 9 |lU of 
dH20, 1 |il of dNTP mix (10 mM), and 1 pJ of random hexamers. The reaction was 
incubated at 65*^C for 5 minutes then placed on ice. The following components were 
then added: 4 pJ of 1'* strand buffer, 1 pi of RNase Inhibitor, 2.0 pi of 0.1 M DTT, 
and 1 pi of Superscript 11 enzyme. The reaction was incubated at room temperature 10 
20 minutes, 42°C for 50 minutes and 70°C for 15 minutes. One pi of RNaseH was added 
and incubated 20 minutes at 37°C, 15 minutes at 70°C and stored at 4°C. 



Example 8: DNA Labeling 

The PGR conditions used to amplify the P450 specific products from genomic 
25 DNA and cDNA were used to amplify the insert of plasmid pCRscript-29. Plasmid 
pCRscript-29 contains a 340bp PGR fragment amplified from SC15847 genomic 
DNA using primers P450 l''(SEQ ID NO:23) and P450 3" (SEQ ID NO:27). Two pi 
of the plasmid prep was used as a template, with a total of 25 cycles. The amplified 
product was gel purified using the Qiaquick gel extraction system (Qiagen). The 
30 extracted DNA was ethanol precipitated and resuspended in 5 p.! of TE, the yield was 
estimated to be 500 ng. This fragment was labeled with digoxigenin using the chem 
link labeling reagent (Roche Molecular Biochemicals, Indianapolis, Indiana catalog 
#1 836 463). Five pi of the PGR product was mixed with 0.5 pJ of Dig-chem link and 
dH20 added to 20 pi. The reaction was incubated 30 minutes at 85°C and 5 |il of stop 
35 solution added. The probe concentration was estimated at 20 ng/pl. 
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5 

Example 9: Southern DNA Hybridization 

Ten fil of genomic DNA (0.5 ^g/|il) was digested with BamHI, Bgin, EcoRI, 
HindEI or NotI and separated at 12 volts for 16 hours. The gel was depurinated 10 
minutes in 0.25 N HCl and transferred by vacuum to a nylon membrane (Roche 

10 Molecular Biochemicals) in 0.4 N NaOH 5" Hg , 90 minutes using a vacuum blotter 
(Bio-Rad Laboratories, Inc. Hercules, California, USA catalog # 165-5000). The 
membrane was rinsed in 1 M ammonium acetate and UV-crosslinked using the 
Stratalinker UV Crosslinker (Stratagene). The membrane was rinsed in 2X SSC and 
stored at room temperature, 

15 The membrane was prehybridized 1 hour at 42°C in 20 ml of Dig Easy Hyb 

buffer (Roche Molecular Biochemicals). The probe was denatured 10 minutes at 65°C 
and then placed on ice. Five ml of probe in Dig-Easy Hyb at an approximate 
concentration on 20 ng/ml was incubated with the membrane at 42°C overnight. The 
membrane was washed 2 times in 2X SCC with 0,1% SDS at room temperature, then 

20 2 times in 0.5X SSC with 0. 1 % SDS at 65°C. The membrane was equilibrated in 
Genius buffer 1 (10 mM maleic acid, 15 mM NaCl; pH 7.5; 0.3% v/v Tween 20) 
(Roche Molecular Biochemicals, Indianapolis, Indiana) for 2 minutes, then incubated 
with 2% blocking solution (2% Blocking reagent in Genius Buffer l)(Roche 
Molecular Biochemicals Indianapolis, Indiana) for 1 hour at room temperature. The 

25 membrane was incubated with a 1:20,000 dilution of anti-dig antibody in 50 ml of 
blocking solution for 30 minutes. The membrane was washed 2 times, 15 minutes 
each in 50 ml of Genius buffer 1. The membrane was equilibrated for two minutes in 
Genius Buffer 3 (lOmM Tris-HCl. lOmM NaCl; pH 9.5), One ml of a 1 :100 dilution 
of CSPD (disodium 3-(4-methoxyspiro{ l,2-dioxetane-3,2'-(5'- 

30 chloro)tricyclo[3.3.1.1^*^]decan)-4-yl)phenyl phosphate) (Roche Molecular 
Biochemicals) in Genius buffer 3 was added to the membrane and incubated 5 
minutes at room temperature, then placed at 3TC for 15 minutes. The membrane was 
exposed to Biomax ML film (Kodak, Rochester, New York, USA) for 1 hour. 

35 Example 10: E, coli Transformation 
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5 Competent cells were purchased from Invitrogen. £. coli strain DHIOB was 

used as a host for genomic cloning. The chemically competent cells were thawed on 
ice and 100 pi aliquoted to a 17 x 100-mm polypropylene tube on ice. One |xl of the 
ligation mixture was added to the cells and incubated on ice for 30 minutes. The cells 
were incubated at 42°C for 45 seconds, then placed on ice 1-2 minutes. 0.9 ml pf 

10 SOC. medium(Invitrogen) was added and the cells were incubated one hour at 30- 
37°C at 200-240 rpm. Cells were plated on a selective medium (Luria agar with 
neomycin or ampicillin at a concentration of 30 |ig /ml or 100 |Lig /ml respectively). 

Example 11: Transformation of Streptomyces lividans TK24 

15 Plasmid pWB19N849 was constructed by digesting plasmid pWB19N with 

Hindin and treating with SAP I and digesting plasmid pANT849 (Keiser, et al., 2000, 
Practical Streptomyces Genetics, John Innes ) with Hindin. The two linearized 
fragments were ligated 1 hour at room temperature with lU of T4 DNA ligase. One jil 
of the ligation reaction was used to transform XL-1 Blue electrocompetent cells 

20 (Stratagene). The recovered cells were plated to LB neomycin (30 |Xg/ml) overnight at 
37X. Colonies were picked to 2 ml of LB with 30 ^lg/ml neomycin and incubated 
overnight at 30°C. MoBio plasmid minipreps were performed on all cultures. 
Plasmids constructed from the ligation of pWB19N and pANT849 were determined 
by electrophoretic mobility on 0.7 % agarose. The plasmid pWB19N849 was digested 

25 with Hindin and BgllT to excise a 5.3 kb fragment equivalent to plasmid pANT849 
digested with Bgin and HindlQ. This 5.3 kb fragment was purified on an agarose gel 
and extracted using the Qiaquick gel extraction system. 

A 1.469 kb DNA fragment containing the epothilone B hydroxylase gene and 
the downstream ferredoxin gene was amplified using PCR. The 50 ^1 PCR reaction 

30 was composed of 5 jil of Taq buffer, 2.5 \i\ glycerol. 1 pi of 20 ng/tU NPB29-1 

plasmid, 0.4 [il of 25 mM dNTPs, 1.0 \xl each of primers NPB29-6F (SEQ ID NO:28) 
and NPB29-7R (SEQ ID NO:29) (5 pmole/^1), 38.1 ^1 of dH20 and 0.5 [ll of Taq 
enzyme (Stratagene). The reactions were performed on a Perkin Elmer 9700, 95*^C for 
5 minutes, then 30 cycles (96°C for 30 seconds, 60°C 30 seconds, 72°C for 2 

35 minutes), and 72X for 7 minutes. The PCR product was purified using a Qiagen 
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5 minieiute column with the PGR cleanup procedure. The purified product was digested 
with Bglll and HindlH and purified on a 0.7 % agarose gel. A 1.469 kb band was 
excised from the gel and eluted using a Qiagen minieiute column. Five yd of this PGR 
product was ligated with 2 ^1 of the BglTI, HindlH digested pANT849 vector in a 10 
\il ligation reaction. The reaction was incubated at room temperature for 24 hours and 

10 then transformed to 5. lividans TK24 protoplasts. 

Twenty ml of YEME media was inoculated with a frozen spore suspension of 
5. lividans TK24 and grown 48 hours in a 125 ml bi-indent flask. Protoplasts were 
prepared as described in Practical Streptomyces Genetics. The ligation reaction was 
mixed with protoplasts, then 500 ^1 of transformation buffer was added, followed 

15 immediately by 5 ml of P buffer. The transformation reactions were spun down 7 
minutes at 2,750 rpm, resuspended in 100 |il of P buffer and plated to one R2YE 
plate. The plate was incubated at 28°C for 20 hours then overlaid with 5 ml of LB 
0.7% agar with 250 P-g/ml thiostrepton. After 7 days colonies were picked to an 
R2YE grid plate with 50 |Xg/ml of thiostrepton. The colonies were grown an 

20 additional 5 days at 28°G, then stored at 4*'G. 

This recombinant microorganism has been deposited with the ATGG and 
designated PTA-4022. 

Example 12: Transformation oi Streptomyces rimosus 
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5 The procedure of Pigac and Schrempf Appl. Environ Microb., Vol. 61, No, 1, 

352-356 (1995) was used to transform S. rimosus, S. rimosus strain R6 593 was 
cultivated in 20 ml of CRM medium at 30 °C on a rotary shaker (250 rpm). The cells 
were harvested at 24 hrs by centrifugation for 5 minutes, 5,000 rpm, 4°C, and 
resuspended in 20 ml of 10% sucrose, 4 ^C, and centrifuged for 5 minutes, 5,000 rpm, 

10 4 **C. The pellet was resuspended in 10 ml of 15% glycerol, 4 °C and centrifuged for 5 
minutes, 5,000 rpm, 4 °C. The pellet was resuspended in 2 ml of 15% glycerol, 4 °C 
with 100 |ig/ml lysozyme and incubated at 37 °C for 30 minutes, centrifuged for 5 
minutes, 5,000 rpm, 4 and resuspended in 2 ml of 15% glycerol, 4 °C. The 15% 
glycerol wash was repeated once and the pellet was resuspended in 1 to 2 ml of 

15 Electroporation Buffer. The cells were stored at -80°C in 50 - 200 Jil aliquots. 

The ligations were prepared as described for the 5, lividans transformation. 
After the incubation of the ligation reaction, the volume was brought to 100 |xl with 
dH20, NaCl was added to 0.3M, and the reaction extracted with an equal volume of 
24:1:1 phenohchoroform isoamyl alcohol. Twenty |xg of glycogen was added and the 

20 ligated DNA was precipitated with 2 volumes of 100% ethanol at -20 °C for 30 

minutes. The DNA was pelleted 10 minutes in a microcentrifuge, washed once with 
70% ethanol, dried 5 minutes in a speed-vac concentrator and resuspended in 5 |xl of 
dH20. 

One frozen aliquot of cells was thawed at room temperature and divided, 50 
25 \\M tube for each DNA sample for electroporation. The cells were stored on ice until 
use. DNA in 1 to 2 ^1 of dH20 was added and mixed. The cell and DNA mixture was 
transferred to a 2 mm gapped electrocuvette (Bio-Rad Laboratories, Richmond 
California USA) that was pre-chilled on ice. The cells were electroporated at a setting 
of 2 kV (lOkV/cm), 25^F, 400 Q using a Gene Pulser™ (Bio-Rad Laboratories). The 
30 cells were diluted with 0.75 to LO ml of CRM (0-4 °C), transferred to 15 ml culture 
tubes and incubated with agitation 3hrs at 30 °C. The cells were plated on trypticase 
soy broth agar plates with 10-30 |ig/ml of thiostrepton. 

Example 13; High Performance liquid chromatography 

35 The liquid chromatography separation was performed using a Waters 2690 

Separation Module system (Waters Corp.. Milford, MA, USA) and a column, 4.6 x 
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5 150 mm, filled with SymmetryShield RPg, particle size 3.5 ^lm (Waters Corp., 

Milford, MA, USA). The gradient mobile phase programming was used with a flow 
rate of 1 .0 ml/minute. Eluent A was water/acetonitrile (20:1) + 10 mM ammonium 
acetate. Eluent B was acetonitrile/water (20:1). The mobile phase was a linear 
gradient from 12% B to 28 % B over 6 minutes and held isocratic at 28% B over 4 

10 minutes. This was followed by a 28% B to 100% B linear gradient over 20 minutes 
and a linear gradient to 12% B over two minutes with a 3 minute hold at 12% B. 

Example 14: Mass spectrometry 

The column effluent was introduced directly into the electrospray ion source of a 
15 ZMD mass spectrometer (Micromass, Manchester, UK). The instrument was calibrated 
using Test Juice reference standard (Waters Corp, Milford, MA, USA) and was delivered 
at a flow of 10 jjj/minute from a syringe pump (Harvard Apparatus, Holliston, MA, 
USA). The mass spectrometer was operated at a low mass resolution of 13.2 and a high 
mass resolution of 1 1.2. Spectra were acquired from using a scan range of m/z 100 to 
20 600 at an acquisition rate of 10 spectra /second. The ionization technique employed was 
positive electrospray (ES). The sprayer voltage was kept at 2900 V and the cone voltage 
of the ion source was kept at a potential of 17 V. 

Example 15: Use of the ebh gene sequence (SEQ ID N0:1) to isolate cytochrome 
25 P450 genes from other microorganisms 

Genomic DNA was isolated from a set of cultures (ATCC43491, 
ATCC 14930, ATCC53630. ATCC53550, ATCC39444, ATCC43333, ATCC35165) 
using the DNAzol reagent. The DNA was used as a template for PCR reactions using 
primers designed to the sequence of the ehh gene. Three sets of primers were used for 

30 amplification; NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). NPB29- 
16f (SEQ ID NO:50) and NPB29-17r (SEQ ID N0:51), and NPB29-19f (SEQ ID 
NO:52) and NPB29-20r (SEQ ID NO:53). 

PCR reactions were prepared in a volume of 20 |Xl, containing 200-500 ng of 
genomic DNA and a forward and reverse primer. All primers were added to a final 

35 concentration of 1.4- 2.0 1.1M. The PCR reaction was prepared with 0.2 jil of 

Advantage™ 2 Taq enzyme (BD Biosciences Clontech, Palo Alto. California, USA), 
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5 2 ^1 of Advantage™ 2 Taq buffer and 0.2 ^1 of 2.5 mM of dNTPs with dHsO to 20 
The cycling reactions were performed on a Geneamp® 9700 PGR system or a 
Mastercycler® gradient (Eppendorf, Westbury, New York, USA) with the following 
protocol: 95°C for 5 minutes, 35 cycles (96^C 20 seconds, 54-69T 30 seconds, 72°C 
2 minutes), 72*'C for 7 minutes. The expected size of the PGR products is 

10 approximately 1469 bp for the NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID 
NO:29) primer pair, 1034 bp for the NPB29-16f (SEQ ID NO:50) and NPB29-17r 
(SEQ ID N0:51) primer pair and 13 18 bp for the NPB29-19f (SEQ ID NO:52) and 
NPB29-20r (SEQ ID NO:53) primer pair. The PGR reactions were analyzed on 0.7% 
agarose gels. PGR products of the expected size were excised from the gel and 

15 purified using the Qiagen gel extraction method. The purified products were 
sequenced using the Big-Dye sequencing kit and analyzed using an ABI310 
sequencer. 
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5 Example 16: Construction of plasmid pPCRscript-ei/i 

A 1.469 kb DNA fragment containing the epothilone B hydroxylase gene and 
the downstream ferredoxin gene was ampHfied using PGR. The 50 \jd PGR reaction 
was composed of 5 fil of Taq buffer. 2.5 |Lil glycerol, 1 ^ti of 20 ng/|al NPB29-1 
plasmid, 0.4 ^1 of 25 mM dNTPs, 1.0 each of primers NPB29-6f (SEQ ID NO:28) 

0 and NPB29-7r (SEQ E) NO:29) (5 pmole/^tl), 38. 1 ^1 of dH20 and 0.5 of Taq 
enzyme (Stratagerie). The reactions were performed on a Geneamp 9700 PGR 
system, with the following conditions; 95°C for 5 minutes, then 30 cycles (96°G for 
30 seconds, 60''C 30 seconds, 72X for 2 minutes), and 72°C for 7 minutes. The PGR 
product was purified using a Qiagen Qiaquick column with the PGR cleanup 

5 procedure. The purified product was digested with Bglll and HindlH and purified on a 
0.7 % agarose gel. A 1.469 kb band was excised from the gel and eluted using a 
Qiagen Qiaquick gel extraction procedure. The fragments were then cloned into the 
pPGRscript Amp vector using the PCRscript Amp cloning kit. Golonies containing 
inserts were picked to 1-2 ml of LB (Luria Broth) with 100 |ig/ml ampicillin, 30- 

0 37°G, 16-24 hours, 230-300 rpm. Plasmid isolation was performed using the Mo Bio 
miniplasmid prep kit. The sequence of the insert was confirmed by cycle sequencing 
with the Big-Dye sequencing kit. This plasmid was named pPGRscript-efc/i. 

Example 17: Mutagenesis of the ebh gene for improved yield or altered 
5 specificity 

The Quikchange® XL Site-Directed Mutagenesis Kit and the Quikchange® 
Multi Site-Directed Mutagenesis kit, both from Stratagene were used to introduce 
mutations in the coding region of the ebh gene. Both of these methods employ DNA 
primers 35-45 bases in length containing the desired mutation (SEQ E) NO:54-59 and 

0 77), a methylated circular plasmid template and PfuTurbo® DNA Polymerase (U.S. 
Patent Nos 5,545,552 and 5,866,395 and 5,948,663) to generate copies of the plasmid 
template incorporating the mutation carried on the mutagenic primers. Subsequent 
digestion of the reaction with the restriction endonuclease enzyme Dpnl, selectively 
digests the methylated plasmid template, but leaves the non-methylated mutated 

5 plasmid intact. The manufacturer's instructions were followed for all procedures with 
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5 the exception of the Dpnl digestion step in which the incubation time was increased 
from 1 hr to 3 hrs. The pPCRscript-ei/z vector was used as the template for 
mutagenesis. 

One to two \Jil of the reaction was transformed to either XL 1 -Blue® 
electrocompetent or XLIO-Gold® ultracompetent cells (Stratagene). Cells were plated 

10 to a density of greater than 100 colonies per plate on LA (Luria Agar) 100 |ig/ml 
ampicillin plates, and incubated 24-48 hrs at 30-37°C. The entire plate was 
resuspended in 5 ml of LB containing 100 P-g/ml ampicillin. Plasmid was isolated 
directly from the resuspended cells by centrifuging the cells and then purifying the 
plasmid using the Mo Bio miniprep procedure. This plasmid was then used as a 

15 template for PGR with primers NPB29-6f (SEQ ID NO:28)and NPB29-7r (SEQ ID 
NO:29) to amphfy a mutated expression cassette. Digestion of the 1.469 kb PGR 
product with the restriction enzymes Bgin and HindHI was used to prepare this 
fragment for ligation to vector pANT849 also digested with Bgin and Hindlll. 
Alternatively, the resuspended cells were used to inoculate 20- 50 ml of LB 

20 containing 100 |ig/ml ampicillin and grown 18-24 hrs at 30-37°G. Qiagen midi-preps 
were performed on the cultures to isolate plasmid DNA containing the desired 
mutation. Digestion with the restriction enzymes Bgin and HindU was used to excise 
the mutated expression cassette for ligation to Bgin and HindHI digested plasmid 
pANT849. Screening of mutants was performed in S. lividans or 5. rimosus as 

25 described. 

Alternatively, the method of Leung et ai, Technique - A Journal of Methods in 
Cell and Molecular Biology , Vol. 1, No. 1, 1 1-15 (1989) was used to generate random 
mutation libraries of the ebh gene. Manganese and/or reduced dATP concentration is 
used to control the mutagenesis frequency of the Taq polymerase. The plasmid 

30 pGRscript-^i/i was digested with NotI to linearize the plasmid. The Polymerase buffer 
was prepared with 0.166 M (NH4)2S04, 0.67M Tris-HCl pH 8.8, 61 mM MgGlz, 67 
|XM EDTA pH8.0, 1.7 mg/ml Bovine Serum Albumin). The PGR reaction was 
prepared with 10 14-1 of Not 1 digested pGRscript-ei?/i (0. Ing/^il), 10 of polymerase 
buffer. 1.0 ^1 of IM P-mercaptoethanol. 10.0 ^1 of DMSO, 1.0 ^U of NPB29-6f (SEQ 

35 ID NO:28) primer (100 pmole/pJ), 1 .0 \i\ of NPB29-7r (SEQ ID NO:29) primer (100 
pmole/|Lll), 10 ^il of 5 mM MnClz, 10.0 ^il 10 mM dOTP, 10.0 ^il 2 mM dATP, 10 mM 
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5 dTTP, 10.0 fil 10 mM dCTP, and 2.0 |Lll Taq polymerase. dH20 was added to 100 ^U. 
Reactions were also prepared as described above but without MnClj. The cycling 
reactions were performed a GeneAmp® PGR system with the following protocol: 
95''C for 1 minute, 25-30 cycles (94 °C for 1 minute, 55 ''C for 30 seconds, 72 ""C for 4 
minutes), 72 for 7 minutes. The PGR reactions were separated on an agarose gel 

10 using a Qiagen spin column. The fragments were then digested with Bglll and Hindlll 
and purified using a Qiagen spin column. The purified fragments were then ligated to 
Bgin and HindHI digested pANT849 plasmids. Screening of mutants was performed 
in 5. lividans and 5. rimosus, 

15 Table of Characterized Mutants 



Mutant 


Position 


Substitution 


Wild-type 


ebh2A-\6 


92 


Valine 


Isoleucine 




237 


Alanine 


Phenylalanine 


ebhlSA 


195 


Serine 


Asparagine 




294 


Proline 


Serine 


ebh\0'5^ 


190 


Tyrosine 


Phenylalanine 




231 


Arginine 


Glutamic acid 




92 


Valine 


Isoleucine 




237 


Alanine 


Phenylalanine 




67 


Glutamine 


Arginine 


^M24-16cll 


92 


Valine 


Isoleucine 




93 


Glycine 


Alanine 




237 


Alanine 


Phenylalanine 




365 


Threonine 


Isoleucine 


ebh24'\6A6 


92 


Valine 


Isoleucine 




106 


Alanine 


Valine 




237 


Alanine 


Phenylalanine 


eM24-16-74 


88 


Histidine 


Arginine 




92 


Valine 


Isoleucine 




237 


Alanine 


Phenylalanine 


ebh-Mli 


31 


Lysine 


Glutamic acid 




176 


Valine 


Methionine 


ebh24A6gS 


92 


Valine 


Isoleucine 
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237 Alanine Phenylalanine 

67 Glutaniine Arginine 

1 30 Threonine Isoleucine 

176 Alanine Methionine 



ebh24-\6h9 



92 

237 

67 

140 

176 



Valine 

Alanine 

Glutamine 

Threonine 

Serine 



Isoleucine 

Phenylalanine 

Arginine 

Alanine 

Methionine 



Example 18: Comparison of epothilone B transformation in cells expressing ebh 
and mutants thereof 

In these experinnents, twenty ml of YEME medium in a 125 ml bi-indented 
10 flask was inoculated with 200 of a frozen spore preparation of 5. lividans TK24, S, 
lividans (pANT849-ei?/i), S. lividans (pANT849-et;i 10-53) or S, lividans (pANT849- 
ebhlA-\6) and incubated 48 hours at 230 rpm, 30°C. Thiostrepton, 10 |ig/ml was 
added to media inoculated with 5. lividans (pANT849-eM), 5. lividans (pANT849- 
eMlO-53) and 5. lividans (pANT849-eM24-16). Four ml of culture was transferred to 
15 20 ml of R5 medium in a 125 ml Erlenmeyer flask and incubated 18 hrs at 230 rpm, 
30°C. Epothilone B in 100% EtOH was added to each culture to a final concentration 
of 0.05% weight/volume. Samples were taken at 0, 24, 48 and 72 hours with the 
exception of the S, lividans (pANT849-efe/i24-16) culture, in which the epothilone B 
had been completely converted to epothilone F at 48 hours. The samples were 
20 analyzed by HPLC. Results were calculated as a percentage of the epothilone B at 
time 0 hours. 
Epothilone B: 



Time (hours) 


TK24 


pANT849-eA/i 


pA^^T849-€*A10-53 


pANT849-e*A24.16 


0 


100% 


100% 


100% 


100% 


24 


99% 


78% 


69% 


56% 


48 


87% 


19% 


39% 


0% 


72 


87% 


0% 


3% 





Epothilone F: 

Time (hours) TK24 pKmU^ehh pANT849-ei^/i 10-53 pANT849-«i>/r 24-16 
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0 0% 0% 0% 0% 

24 0% 4% 9% 23% 

48 0% 21% 29% 52% 

72 0% 14% 41% 



Alternatively, the bioconversion of epothilone B to epothilone F was 
performed in S. rimosus host cells transformed with expression plasmids containing 
the ehh gene and its variants or mutants. One-hundred pJ of a frozen S. rimosus 
transformant culture was inoculated to 20 ml CRM media with 10 |ig/nil thiostrepton 
10 and cultivated 16-24 hr, 30°C, 230- 300 rpm. Epothilone B in 100% ethanol was 
added to each culture to a final concentration of 0.05% weight/volume. The reaction 
was typically incubated 20- 40hrs at 30 ""C, 230-300 rpm. The concentration of 
epothilones B and F was determined by HPLC analysis. 



Evaluation of mutants in 5. rimosus 



Mutant 


Epothilone F yield 


ebh-Ml8 


55% 


eM24-16d8 


75% 


eZ?/z24-16cll 


75% 


ei/j24-16-16 


75% 


ebh24-16-14 


75% 


ebh2A-\6h9 80% 


efe/i24-16g8 


85% 



Example 19: Biotransformation of compactin to pravastatin 

Twenty ml of R2YE media with 10 ^ig/ml thiostrepton in a 125 ml flask was 
inoculated with 200 [il of a frozen spore preparation of 5. lividans (pANT849), S, 
lividans (pANT849-eZ7/() and incubated 72 hours at 230 rpm, 28°C. Four ml of culture 
20 was inoculated to 20 ml of R2YE media and grown 24 hours at 230 rpm, 28°C. One 
ml of culture was transferred to a 15 ml polypropylene culture tube, 10 yl of 
compactin (40 mg/ml) was added to each culture and incubated for 24 hours, 28°C, 
250 rpm. Five hundred [il of the culture broth was transferred to a fresh 15 ml 
polypropylene culture tube. Five hundred |il of 50 mM sodium hydroxide was added 
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5 and vortexed. Three ml of methanol was added and vortexed, the tube was 

centrifuged 10 nfiinutes at 3000 rpm in a TJ-6 table-top centrifuge. The organic phase 
was analyzed by HPLC. Compactin and pravastatin values were assessed relative to 
the control S. lividans (pANT849) culture. 



10 Compactin and Pravastatin as a Percentage of Starting Compactin 
Concentration: 





S. lividans (pANT849) 


S. lividans (pANT849-eM) 


Compactin 


36% 


11% 


Pravastatin 


11% 


53% 



Example 20: High performance liquid chromatography method for compactin and 
pravastatin detection 

15 The liquid chromatography separation was performed using a Hewlett 

Packardl090 Series Separation system (Agilent Technologies, Palo Alto, California, 
USA) and a column, 50x46 mm, filled with Spherisorb 0DS2, particle size 5 \xm 
(Keystone Scientific, Inc, Bellefonte, Pennsylvania, USA). The gradient mobile 
phase programming was used with a flow rate of 2.0 ml/minute. Eluent A was water, 

20 10 mM ammonium acetate and 0.05% Phosphoric Acid. Eluent B was acetonitrile. 
The mobile phase was a linear gradient from 20% B to 90 % B over 4 minutes. 

Example 21: Structure determination of the biotransformation product of 
mutant ebhlS-l 

25 Analytical HPLC was performed using a Hewlett Packard 1 100 Series Liquid 

Chromatograph with a YMC Packed ODS-AQ column, 4.6 nun i.d. x 15 cm 1. A 
gradient system of water (solvent A) and acetonitrile (solvent B) was used: 20% to 
90% B linear gradient, 10 minutes; 90% to 20% linear gradient, 2 minutes. The flow 
rate was 1 ml/minute and UV detection was at 254 nm, 

30 Preparative HPLC was performed using the following equipment and 

conditions: 

Pump: Varian ProStar Solvent Delivery Module (Varian Inc., Palo Alto, California, 
USA). Detector: Gynkotek UVD340S. 

Column: YMC ODS-A colunm (30mmID X 100 mm length, 5p, Dparticle size). 
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5 Elution flow rate: 30 ml/minute 

Elution gradient: (solvent A: water; solvent B: acetonitrile), 20% B, 2 minutes; 20% 
to 60% B linear gradient, 18 minutes; 60% B, 2 minutes; 60 % to 90% B linear 
gradient, 1 minute; 90 % B, 3 minutes; 90 % to 20% B linear gradient, 2 minutes. 
Detection: UV, 210nm. 

10 LC/NMR was performed as follows: 40 |ll of sample was injected onto a 

YMC Packed ODS-AQ column (4.6 mm i.d. x 15 cm 1). The column was eluted at 1 
ml/minute flow rate with a gradient system of D2O (solvent A) and acetonitrile-ds 
(solvent B): 30% B, 1 nndnute; 30% to 80% B linear gradient, 11 minutes. The eluent 
passed a UV detection cell (monitored at 254 nm) before flowing through a F19/H1 

15 NMR probe (60 ^1 active volume) in Varian AS-600 NMR spectrometer. The 
biotransformation product was eluted at around 7.5 minutes and the flow was stopped 
manually to allow the eluent to remain in the NMR probe for NMR data acquisition. 

Isolation and analysis was performed as follows. The butanol/methanol extract 
(about 10 ml) was evaporated to dryness under nitrogen stream. One ml methanol 

20 was added to the residue (38 mg) and insoluble material was removed by 

centrifugation (13000 rpm. 2 min). 0.1 ml of the supernatant was used for LC/NMR 
study and the rest of 0.9 ml was subjected to the preparative HPLC (0.2-0.4 ml per 
injection). Two major peaks were observed and collected: peak A was eluted between 
14 and 15 minutes, while peak B was eluted between 16.5 and 17.5 minutes. 

25 Analytical HPLC analysis indicated that peak B was the parent compound, epothilone 
B (Rt 8.5 minutes), and peak A was the biotransformation product (Rt 7.3 minutes). 
The peak A fractions were pooled and MS analysis data was obtained with the pooled 
fractions. The pooled fraction was evaporated to a small volume, then was 
lyophilized to give 3 mg of white solid. NMR and HPLC analysis of the white solid 

30 (dissolved in methanol) revealed that the biotransformation product was partially 
decomposed during the drying process. 
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5 APPENDIX 1 



Atom No. 


Residue 


Atom Nam 


X-coord 


Y-coord 


Z-coord 


1 


ALA9 


N 


-39.918 


-4.913 


-1.651 


2 


ALA9 


CA 


-38-454 


-5.033 


-1.537 


3 


ALA9 


C 


-37.953 


-4.886 


•0.099 


4 


ALA9 


0 


-38.625 


-4.31 


0.765 


5 


ALA9 


CB 


-37.809 


-3.967 


-2.415 


6 


THR10 


N 


-36.781 


-5.447 


0.146 


7 


THR10 


CA 


-36.187 


-5:437 


1.49 


8 


THR10 


C 


-34.916 


-4.585 


1.553 


9 


THR10 


0 


-34.016 


-4.735 


0.72 


10 


THR10 


CB 


-35.871 


-6.887 


1.846 


11 


THR10 


0G1 


-37.075 


-7.631 


1.717 


12 


THR10 


CG2 


-35.355 


-7.053 


3.271 


13 


LEU11 


N 


-34.858 


-3.699 


2.536 


14 


LEU11 


CA 


-33.669 


-2.853 


2.745 


15 


LEU11 


C 


-32.511 


-3.649 


3.353 


16 


LEU11 


0 


-32.706 


-4.468 


4.259 


17 


LEU11 


CB 


-34.033 


-1.707 


3.687 


18 


LEU11 


CG 


-35.079 


-078 


3.078 


19 


LEU 11 


CD1 


-35.53 


0.265 


4.091 


20 


LEU11 


CD2 


-34.555 


-0.111 


1.81 


21 


PR012 


N 


-31.32 


-3.422 


2.823 


22 


PR012 


CA 


-30.121 


-4.119 


3.302 


23 


PR012 


C 


-29.652 


-3.606 


4.663 


24 


PR012 


0 


-29.656 


-2.397 


4.918 


25 


PR012 


CB 


-29.081 


-3.842 


2.259 


26 


PR012 


CG 


-29.597 


-2.771 


1.309 


27 


PR012 


CD 


-31.031 


-2.493 


1.729 


28 


LEU13 


N 


-29.278 


-4.522 


5.54 


29 


LEU13 


CA 


-28.676 


-4.118 


6.819 


30 


LEU13 


C 


-27.183 


-3.88 


6.627 


31 


LEU13 


0 


-26.449 


-4.806 


6.267 


32 


LEU13 


CB 


-28.898 


-5.196 


7.872 


33 


LEU 13 


CG 


-30.374 


-5.354 


8.217 


34 


LEU13 


GDI 


-30.587 


-6.516 


9.181 


35 


LEU 13 


CD2 


-30.945 


-4.067 


8.802 


36 


ALjM4 


N 


-26.72 


-2,741 


7.112 


37 


ALA14 


CA 


-25.355 


-2.266 


6.825 


38 


ALA14 


C 


-24.244 


-2.941 


7.634 


39 


ALA14 


0 


-23.058 


-2.719 


7.372 


40 


ALA14 


CB 


-25,311 


-0.764 


7.075 


41 


ARG15 


N 


-24.628 


-3.792 


8.569 


42 


ARG15 


CA 


-23.664 


-4.537 


9.379 


43 


ARG15 


C 


-23.478 


-5.983 


8.91 


44 


ARG15 


0 


-22.815 


-6.767 


9.599 


45 


ARG15 


CB 


-24.174 


-4.519 


10.81 


46 


ARG15 


CG 


-25.655 


-4.879 


10.84 


47 


ARG15 


CD 


-26.2 


-4.843 


12.26 


48 


ARG15 


NE 


-27.657 


-5.039 


12.256 


49 


ARG15 


cz 


-28.358 


-5.301 


13.36 


50 


ARG15 


NH1 


-29.69 


-5.376 


13.3 


51 


ARG15 


NH2 


-27.735 


-5.412 


14.536 


52 


LYS16 


N 


-24.096 


-6.351 


7.798 
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53 


LYS16 


CA 


-24.016 


-7.741 


7.335 


54 


LYS16 


c 


-22.639 


-8.128 


6.807 


55 


LYS16 


0 


-21.959 


-7,359 


6.115 


56 


LYS16 


CB 


-25.061 


-7,977 


6.252 


57 


LYS16 


CG 


-26.466 


-7,985 


6.839 


58 


LYS16 


CD 


-26.605 


-9,079 


7.892 


59 


LYS16 


CE 


-28.002 


-9.092 


8.499 


60 


LYS16 


NZ 


-28.113 


-10.128 


9.537 


61 


CYS17 


N 


-22.317 


-9,392 


7.036 


62 


CYS17 


CA 


-21.061 


-10.004 


6.56 


63 


CYS17 


c 


-20.737 


-9.771 


5.066 


64 


CYS17 


0 




-9.205 




65 


CYS17 


CB 




-11 sni 


A ftR4 


66 


CYS17 






-11 


ft finp 


67 


PR018 


N 




-1 n CiCi'\ 


A 1 


68 


PR018 


CA 




-9.756 




69 


PR018 


Q 


-P1 IP^^ 


-ft PQ1 


P P4fi 


70 


PR018 


0 


-P1 m*^ 


-ft nfii 




71 


PR018 


CB 


-PP '^Aft 


-in '^ft'^ 


1 ft7ft 
1 .0/ 0 


72 


PR018 


CG 




-in flip 


p ftnp 


73 


PR018 


rn 




-in 


4 pn7 


74 


PHE19 


N 


-P1 1*^7 

^ 1 . 1 o/ 


-7 *V\ 


1RP 


75 


PHF1Q 


HA 


_on 7QO 
-^u. / at 


- j.a*T/ 


P ft'^4 


76 


PHF1Q 


c 


-10 P7Q 


-S 777 


P 7ftft 


77 




n 


7ftQ 
1 o. /O^ 






78 


PHE19 


r:R 


-P1 ^ft 


-S nnv 


ftOA 


79 


PHE19 


CG 


-PP ft 






80 


PHF1Q 




-^O.UO 1 


-'^ P7 


T P'^P 


81 


PHE19 






-S 44.4 


^ ftR7 


82 


PHE19 


nFi 


-PA '^'^'^ 


-P ftS*^ 




83 


PHE19 


CFP 


-PS 1'^Q 


-S n*^ 


fiPQ 


84 


PHE19 






■O. f oo 


107 


85 


SER20 


N 


-1ft '^7'^ 


-fi ftft7 


T 440 


86 




PA 


-17 inp 


-fi 717 


44R 


87 


O L_l \^\J 






-7 ft'^Q 


4 '^4P 


88 


SER20 




-1R fi'^P 


-7 7P'^ 




89 


SER20 


CB 


-1fi ^^^7 


-S '^71 


QPQ 


90 


SER20 


00 


-17.236 


-5.019 


5.129 


91 


PR021 


N 


-IS Q74 


-ft ftfi7 




92 


PR021 


CA 


-15.978 


-9.134 


2.304 


93 


PR021 


c 


-17.267 


-9-836 


1.856 


94 


PR021 


0 


-18.026 


-10.327 


2.702 


95 


PR021 


CB 


-14.8 


-10.047 


2.111 


96 


PR021 


CG 


-14.442 


-10.669 


3.455 


97 


PR021 


CD 


-15.306 


-9.949 


4.481 


98 


PR022 


N 


-17.551 


-9,859 


0.561 


99 


PR022 


CA 


-16.897 


-9.007 


-0.445 


100 


PR022 


C 


-17.4 


-7.575 


-0.296 


101 


PR022 


0 


-18.341 


-7.371 


0.469 


102 


PR022 


CB 


-17.32 


-9.591 


-1.762 


103 


PR022 


CG 


-18.478 


-10.549 


-1.528 


104 


PR022 


CD 


-18.669 


-10,604 


-0.021 


105 


PR023 


N 


-16.687 


-6.605 


-0.842 


106 


PR023 


CA 


-17.224 


-5.241 


-0.897 
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O 



o 



107 


PR023 


C 


-18.525 


-5.21 


-1-693 


108 


PR023 


0 


-18.524 


-5.083 


-2.925 


109 


PR023 


CB 


-16.159 


-4,417 


-1.547 


110 


PR023 


CG 


-15.004 


-5.321 


-1,95 


111 


PR023 


CD 


-15.388 


-6.725 


-1.509 


112 


GLU24 


N 


-19.62 


-5.122 


-0.956 


113 


GLU24 


CA 


-20.963 


-5.192 


-1.547 


114 


GLU24 


C 


-21 .41 5 


-3,843 


-2.088 


115 


GLU24 


0 


-22,323 


-3.794 


-2.93 


116 


GLU24 


CB 


-21.934 


-5.68 


-0.48 


117 


GLU24 


CG 


-23.27 


-6.137 


-1,052 


118 


GLU24 


CD 


-23.982 


-7.017 


-0.024 


119 


GLU24 


0E1 


-24.613 


-7.981 


-0.433 


120 


GLU24 


0E2 


-23.833 


-6.745 


1.158 


121 


TYR25 


N 


-20.573 


-2.843 


-1.878 


122 


TYR25 


CA 


-20,842 


-1.47 


-2.303 


123 


TYR25 


c 


-20.704 


-1.311 


-3.816 


124 


TYR25 


0 


-21,364 


-0.436 


-4.385 


125 


TYR25 


CB 


-19.828 


-0.568 


-1.608 


126 


TYR25 


CG 


-19.616 


-0.882 


-0.128 


127 


TYR25 


CD1 


-20.662 


-0,753 


0.779 


IPft 


TYR2S 

1 T r\^\j 


CD2 




-1.298 


0.31 1 


1 ^9 


TYR25 


CF1 


-?0 461 


-1.062 


2.119 


130 


TYR25 


CP2 


-18 163 


-1.605 


1.65 


1 O 1 


1 1 n£\j 


C7 

V/*- 




-1 .492 


2.55 


1*^2 


TYR25 


OH 


-1Q 026 


-1.859 


3.866 


1 '^'^ 

1 Ox^ 




N 




-2.296 


-4.468 


1*^4 

1 


GLU26 


CA 




-2.293 


-5.928 


135 


GLU26 


c 


-21 .404 


-2.483 


-6.52 


136 


GLU26 


o 


-21 .92 


-1.572 


-7.177 


137 


GLU26 


CB 


-19.129 


-3.454 


-6.39 


138 


GLU26 


CG 


-17.813 


-3.593 


-5.628 


139 


GLU26 


CD 


-16.94 


-2.342 


-5.707 


140 


GLU26 


0E1 


-16.345 


-2.12 


-6.749 


141 


GLU26 


0E2 


-16.773 


-1.731 


-4.657 


142 


ARG27 


IS] 


-22.105 


-3.488 


-6.017 


143 


ARG27 


CA 


-23.437 


-3.805 


-6.538 


144 


ARG27 


c 


-24.504 


-2.909 


-5.921 


145 


ARG27 


0 


-25.496 


-2.591 


-6.59 


146 


ARG27 


CB 


-23.752 


-5.26 


-6.22 


147 


ARG27 


CG 


-22.7 


-6.189 


-6.812 


148 


ARG27 


CD 


-23.031 


-7.653 


-6.55 


149 


ARG27 


NE 


-23.146 


-7.926 


-5.108 


150 


ARG27 


CZ 


-22.251 


-8.648 


-4.428 


151 


ARG27 


NH1 


-21.16 


-9.11 


-5.043 


152 


ARG27 


NH2 


-22.428 


-8.879 


-3.126 


153 


LEU28 


N 


-24.197 


-2.331 


-4.771 


154 


LEU28 


CA 


-25.11 


-1.358 


-4.168 


155 


LEU28 


C 


-25.131 


-0.079 


-4.987 


156 


LEU28 


0 


-26.214 


0.286 


-5.45 


157 


LEU28 


CB 


-24.67 


-1.039 


-2.746 


158 


LEU28 


CG 


-24.868 


-2.224 


-1.81 


159 


LEU28 


CD1 


-24.303 


-1.916 


-0.43 


160 


LEU28 


CD2 


-26,34 


-2.609 


-1.716 
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161 


ARG29 


N 


-23.969 


0.307 


-5.49 


162 


ARG29 


CA 


-23.835 


1.502 


-6.327 


163 


ARG29 


c 


-24.521 


1.334 


-7.677 


164 


ARG29 


0 


-25.271 


2.226 


-8.096 


165 


ARG29 


CB 


-22.345 


1.682 


-6.568 


166 


ARG29 


CO 


-21 .997 


2.947 


-7.336 


167 


ARG29 


CD 


-20.519 


2.941 


-7.711 


168 
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1836 


ILE241 


C 


-6.509 


8.091 


17.184 


1837 


ILE241 


0 


-6.406 


6.98 


16.64 


1838 


ILE241 


CB 


-8.544 


8.391 


18.599 


1839 


ILE241 


CG1 


-9.122 


8.403 


20.008 


1840 


ILE241 


CG2 


-9.233 


7.311 


17.771 


1841 


ILE241 


CD1 


-8.939 


7.06 


20.699 


1842 


ALA242 


N 


-5.959 


9.182 


16.676 


1843 


ALA242 


CA 


-5.353 


9.154 


15.345 


1844 


ALA242 


C 


-4.049 


8.363 


15.361 


1845 


ALA242 


0 


-3.842 


7.511 


14.487 


1846 


ALA242 


CB 


-5.09 


10.583 


14.884 


1847 


GLY243 


N 


-3.34 


8.426 


16.473 


1848 


GLY243 


CA 


-2.135 


7.612 


16.614 


1849 


GLY243 


C 


-2.461 


6.175 


17.012 


1850 


GLY243 


0 


-2.095 


5.238 


16.289 


1851 


HIS244 


N 


-3.274 


6.021 


18.047 


1852 


HIS244 


CA 


-3.556 


4.687 


18.603 


1853 


HIS244 


C 


-4.424 


3.78 


17.729 


1854 


HIS244 


0 


-4.288 


2.555 


17.808 


1855 


HIS244 


CB 


-4.271 


4.846 


19.946 


1856 


HIS244 


CG 


-3.394 


5.206 


21.132 


1857 


HIS244 


ND1 


-2.995 


6.436 


21.502 


1858 


HIS244 


CD2 


-2.87 


4.327 


22.051 


1859 


HIS244 


CE1 


-2.232 


6.348 


22.61 


1860 


HIS244 


NE2 


-2.156 


5.042 


22.95 


1861 


GLU245 


N 


-5.298 


4.33 


16.905 


1862 


GLU245 


CA 


-6.157 


3.454 


16.101 


1863 


GLU245 


c 


-6.041 


3.723 


14.61 


1864 


GLU245 


0 


-5.883 


2.788 


13.812 


1865 


GLU245 


CB 


-7.616 


3.66 


16.504 


1866 


GLU245 


CG 


-7.895 


3.313 


17.963 


1867 


GLU245 


CD 


-7.613 


1.839 


18.255 


1868 


GLU245 


0E1 


-7.721 


1.028 


17.348 


1869 


GLU245 


0E2 


-7.228 


1.562 


19.381 


1870 


THR246 


N 


-5.987 


4.996 


14.26 


1871 


THR246 


CA 


-6.101 


5.374 


12.849 


1872 


THR246 


C 


-4.863 


4.962 


12.066 


1873 


THR246 


0 


-4.949 


4.04 


11.244 


1874 


THR246 


CB 


-6.314 


6.88 


12.759 


1875 


THR246 


0G1 


-7.415 


7.225 


13.59 


1876 


THR246 


CG2 


-6.614 


7.344 


11.338 


1877 


THR247 


N 


-3.702 


5.384 


12.532 


1878 


THR247 


CA 


-2.48 


5.072 


11.792 


1879 


THR247 


C 


-2.013 


3.634 


12.044 


1880 


THR247 


0 


-1.437 


3.032 


11.132 


1881 


THR247 


CB 


-1.4 


6.078 


12.171 


1882 


THR247 


0G1 


-1.937 


7.39 


12.062 


1883 


THR247 


CG2 


■0.192 


5.981 


11.246 


1884 


ALA248 


N 


-2.533 


3.007 


13.09 


1885 


ALA248 


CA 


-2.225 


1.596 


13.35 


1886 


ALA248 


C 


-2.915 


0.676 


12.342 


1887 


ALA248 


O 


-2.236 


-0.089 


11.643 


1888 


ALA248 


CB 


-2.678 


1.25 


14.763 
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1889 


ASN249 


N 


-4.175 


0.958 


12.049 


1890 


ASN249 


CA 


-4.878 


0.157 


11.042 


1891 


ASN249 


C 


-4.453 


0.517 


9.624 


1892 


ASN249 


0 


-4.313 


-0.39 


8.791 


1893 


ASN249 


CB 


-6.377 


0.361 


11.199 


1894 


ASN249 


CG 


-6.902 


-0.561 


12.291 


1895 


ASN249 


0D1 


-6.261 


-1.569 


12.613 


1896 


ASN249 


ND2 


-8.133 


-0.32 


12.703 


1897 


MET250 


N 


-3.945 


1.727 


9.463 


1898 


MET250 


CA 


-3.472 


2.183 


8.159 


1899 


MET250 


C 


-2.137 


1.529 


7.787 


1900 


MET250 


0 


-2.018 


1.009 


6.669 


1901 


MET250 


CB 


-3.231 


3.702 


8.255 


1902 


MET250 


CG 


-3.343 


4.33 


6.851 


1903 


MET250 


SD 


-2.234 


5.768 


6.665 


1904 


MET250 


CE 


-1.954 


6.241 


8.406 


1905 


ILE251 


N 


-1.267 


1.317 


8.764 


1906 


ILE251 


CA 


0.019 


0.667 


8.477 


1907 


ILE251 


C 


-0.132 


-0.844 


8.328 


1908 


ILE251 


0 


0.449 


-1.427 


7.402 


1909 


ILE251 


CB 


1.001 


0.949 


9.613 


1910 


ILE251 


CGI 


1.293 


2.434 


9.757 


1911 


ILE251 


CG2 


2.305 


0.188 


9.404 


1912 


ILE251 


CD1 


2.264 


2.698 


10.902 


1913 


SER252 


N 


-1.081 


-1.421 


9.047 


1914 


SER252 


CA 


-1.254 


-2.875 


8.973 


1915 


SER252 


c 


-1.947 


-3.301 


7.679 


1916 


SER252 


0 


-1.444 


-4.204 


6,998 


1917 


SER252 


CB 


-2.03 


-3.36 


10.197 


1918 


SER252 


OG 


-3.28 


-2.684 


10.279 


1919 


LEU253 


N 


-2.85 


-2.465 


7.194 


1920 


LEU253 


CA 


-3.558 


-2.762 


5.948 


1921 


LEU253 


C 


-2.699 


-2.386 


4.739 


1922 


LEU253 


O 


-2.665 


-3.129 


3.748 


1923 


LEU253 


CB 


-4.857 


-1.963 


5.994 


1924 


LEU253 


CG 


-5.781 


-2.185 


4.804 


1925 


LEU253 


CD1 


-6.009 


-3.665 


4.521 


1926 


LEU253 


CD2 


-7.109 


-1.479 


5.058 


1927 


GLY254 


N 


-1.818 


-1.42 


4.948 


1928 


GLY254 


CA 


-0.841 


-1.031 


3.933 


1929 


GLY254 


C 


0.161 


-2.148 


3.671 


1930 


GLY254 


0 


0.267 


-2.62 


2.531 


1931 


THR255 


N 


0.707 


-2.706 


4.742 


1932 


THR255 


CA 


1.711 


-3.772 


4.62 


1933 


THR255 


C 


1.123 


-5.042 


4.017 


1934 


THR255 


0 


1.715 


-5.592 


3.079 


1935 


THR255 


CB 


2.255 


-4.105 


6.007 


1936 


THR255 


0G1 


2.837 


-2.935 


6.563 


1937 


THR255 


CG2 


3.334 


-5.18 


5.938 


1938 


VAL256 


N 


-0.133 


-5.321 


4.331 


1939 


VAL256 


CA 


-0.797 


-6.505 


3.781 


1940 


VAL256 


C 


-1.11 


-6.37 


2.291 


1941 


VAL256 


0 


-0.832 


-7.307 


1.531 


1942 


VAL256 


CB 


-2.083 


-6.706 


4.571 
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1943 


VAL256 


CGI 


-3.028 


-7.693 


3.905 


1944 


VAL256 


CG2 


-1 .774 


-7.144 


5.993 


1945 


THR257 


N 


-1.372 


-5.158 


1.833 


1946 


THR257 


CA 


-1 .675 


-4.982 


0.413 


1947 


THR257 


C 


-0.401 


-4.902 


-0.427 


1948 


THR257 


0 


-0.357 


-5.499 


-1.512 


1949 


THR257 


CB 


-2.502 


-3.717 


0.248 


1950 


THR257 


0G1 


-3.63 


-3.818 


1.106 


1951 


THR257 


CG2 


-3.002 


-3.558 


-1.183 


1952 


LEU258 


N 


0.69 


-4.465 


0.185 


1953 


LEU258 


CA 


1.976 


-4.434 


-0.526 


1954 


LEU258 


C 


2.588 


-5.828 


-0.61 


1955 


LEU258 


0 


3.147 


-6.205 


-1.648 


1956 


LEU258 


CB 


2.936 


-3.524 


0.233 


1957 


LEU258 


CG 


2.45 


-2.08 


0.263 


1958 


LEU258 


CD1 


3.329 


-1.229 


1.172 


1959 


LEU258 


CD2 


2.391 


-1 .492 


-1,141 


1960 


LEU259 


N 


2.248 


-6.652 


0.368 


1961 


LEU259 


CA 


2.721 


-8.034 


0.407 


1962 


LEU259 


C 


1.818 


-8.965 


-0.407 


1963 


LEU259 


0 


2.223 


-10.083 


-0.747 


1964 


LEU259 


CB 


2.742 


-8.462 


1.869 


1965 


LEU259 


CG 


3.979 


-9.291 


2.183 


1966 


LEU259 


CD1 


5.235 


-8.559 


1.727 


1967 


LEU259 


CD2 


4.05 


-9.61 


3.672 


1968 


GLU260 


N 


0.649 


-8.473 


-0.791 


1969 


GLU260 


CA 


-0.221 


-9.215 


-1.707 


1970 


GLU260 


C 


0.163 


-8.932 


-3.151 


1971 


GLU260 


0 


0.033 


-9.804 


-4.019 


1972 


GLU260 


CB 


-1.67 


-8.773 


-1.531 


1973 


GLU260 


CG 


-2.53 


-9.85 


-0.883 


1974 


GLU260 


CD 


-2.442 


-9.78 


0.637 


1975 


GLU260 


0E1 


-1.632 


-10.498 


1.204 


1976 


GLU260 


0E2 


-3.287 


-9.103 


1.202 


1977 


ASN261 


N 


0.693 


-7.743 


-3.382 


1978 


ASN261 


CA 


1.158 


-7.381 


-4.723 


1979 


ASN261 


C 


2.669 


-7.181 


-4.742 


1980 


ASN261 


0 


3.134 


-6.032 


-4.804 


1981 


ASN261 


CB 


0.468 


-6.089 


-5.161 


1982 


ASN261 


CG 


-0.986 


-6.311 


-5.586 


1983 


ASN261 


0D1 


-1.784 


-6.942 


-4.881 


1984 


ASN261 


ND2 


-1.339 


-5.701 


-6.705 


1985 


PR0262 


N 


3.404 


-8.259 


-4.986 


1986 


PR0262 


CA 


4.864 


-8.219 


-4.846 


1987 


PR0262 


C 


5.563 


-7.432 


-5.958 


1988 


PR0262 


0 


6.612 


-6.834 


-5.706 


1989 


PR0262 


CB 


5.298 


-9.652 


-4.871 


1990 


PR0262 


CG 


4.108 


-10.533 


-5.223 


1991 


PR0262 


CD 


2.912 


-9.601 


-5.319 


1992 


ASP263 


N 


4.884 


-7.222 


-7.077 


1993 


ASP263 


CA 


5.442 


-6.411 


-8.166 


1994 


ASP263 


C 


5.27 


-4.909 


-7.918 


1995 


ASP263 


0 


6.133 


-4.124 


-8.327 


1996 


ASP263 


CB 


4.783 


-6.822 


-9.488 
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1997 


ASP263 


CG 


3.253 


-6.778 


-9.42 


1998 


ASP263 


0D1 


2.69 


-5.734 


-9.721 


1999 


ASP263 


0D2 


2.673 


-7.766 


-8.992 


2000 


GLN264 


N 


4.358 


-4.558 


-7.024 


2001 


GLN264 


CA 


4.149 


-3.154 


-6.675 


2002 


GLN264 


C 


5.104 


-2.803 


-5.545 


2003 


GLN264 


0 


5.782 


-1.769 


-5.59 


2004 


GLN264 


CB 


2.709 


-3.002 


-6.206 


2005 


GLN264 


CG 


1.723 


-3.483 


-7.265 


2006 


GLN264 


CD 


1.637 


-2.485 


-8.412 


2007 


GLN264 


0E1 


1.631 


-1.274 


-8.178 


2008 


GLN264 


NE2 


1.592 


-2.991 


-9.63 


2009 


LEU265 


N 


5.398 


-3.825 


-4.758 


2010 


LEU265 


CA 


6.389 


-3.701 


-3.693 


2011 


LEU265 


C 


7.808 


-3.677 


-4.263 


2012 


LEU265 


0 


8.639 


-2.888 


-3.798 


2013 


LEU265 


CB 


6.218 


-4.903 


-2.775 


2014 


LEU265 


CG 


7.134 


-4.809 


-1.565 


2015 


LEU265 


CD1 


6.869 


-3.518 


-0.802 


2016 


LEU265 


CD2 


6.958 


-6.023 


-0.662 


2017 


ALA266 


N 


7,993 


-4.309 


-5.41 1 


2018 


ALA266 


CA 


9.286 


-4.267 


-6.093 


2019 


ALA266 


C 


9.528 


-2.929 


-6.785 


2020 


ALA266 


0 


10.66 


-2.434 


-6.737 


2021 


ALA266 


CB 


9.333 


-5.39 


-7.123 


2022 


LYS267 


N 


8.466 


-2.235 


-7.167 


2023 


LYS267 


CA 


8.641 


-0.891 


-7.725 


2024 


LYS267 


C 


8.887 


0.136 


-6.625 


2025 


LYS267 


0 


9.706 


1.042 


-6.818 


2026 


LYS267 


CB 


7.406 


-0.501 


-8.523 


2027 


LYS267 


CG 


7.223 


-1.394 


-9.742 


2028 


LYS267 


CD 


6.072 


-0.894 


-10.604 


2029 


LYS267 


CE 


4.779 


-0.825 


-9.803 


2030 


LYS267 


NZ 


3.688 


-0.267 


-10.615 


2031 


ILE268 


N 


8.413 


-0.162 


-5.427 


2032 


ILE268 


CA 


8.717 


0.664 


-4.254 


2033 


ILE268 


C 


10.181 


0.531 


-3.84 


2034 


ILE268 


0 


10.879 


1.544 


-3.697 


2035 


ILE268 


CB 


7.827 


0.171 


-3.117 


2036 


ILE268 


CGI 


6.38 


0.576 


-3.335 


2037 


ILE268 


CG2 


8.311 


0.638 


-1.75 


2038 


ILE268 


CD1 


5.504 


0.057 


-2.205 


2039 


LYS269 


N 


10.693 


-0.688 


-3.902 


2040 


LYS269 


CA 


12.068 


-0.95 


-3.463 


2041 


LYS269 


C 


13.116 


-0.675 


-4.542 


2042 


LYS269 


0 


14.306 


-0.554 


-4.228 


2043 


LYS269 


CB 


12.126 


-2.405 


-3.021 


2044 


LYS269 


CG 


11.167 


-2.628 


-1.858 


2045 


LYS269 


CD 


10.997 


-4.107 


-1.542 


2046 


LYS269 


CE 


12.315 


-4.756 


-1.148 


2047 


LYS269 


NZ 


12.106 


-6.181 


-0.856 


2048 


ALA270 


N 


12.679 


-0.53 


-5.782 


2049 


ALA270 


CA 


13.585 


-0.106 


-6.851 


2050 


ALA270 


C 


13.478 


1.396 


-7.101 
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2051 


ALA270 


0 


14.286 


1.974 


-7.838 


2052 


ALA270 


CB 


13.233 


-0.863 


-8.125 


2053 


ASP271 


N 


12.486 


2.017 


-6.486 


2054 


ASP271 


CA 


12.271 


3.453 


-6.649 


2055 


ASP271 


C 


11.505 


4.006 


-5.448 


2056 


ASP271 


0 


10.267 


4.02 


-5.445 


2057 


ASP271 


CB 


11.48 


3.653 


-7.944 


2058 


ASP271 


CG 


11.354 


5.125 


-8.337 


2059 


ASP271 


0D1 


10.975 


5.919 


-7.482 


2060 


ASP271 


0D2 


11.493 


5.405 


-9.517 


2061 


PR0272 


N 


12.238 


4.705 


-4.592 


2062 


PR0272 


CA 


11.686 


5.248 


-3.337 


2063 


PR0272 


C 


10.72 


6.438 


-3.497 


2064 


PR0272 


0 


9.973 


6.74 


-2.558 


2065 


PR0272 


CB 


12.887 


5.657 


-2.54 


2066 


PR0272 


CG 


14.133 


5.544 


-3.406 


2067 


PR0272 


CD 


13.673 


4.968 


-4.734 


2068 


GLY273 


N 


10.586 


6,971 


-4.702 


2069 


GLY273 


CA 


9.607 


8.03 


-4.968 


2070 


GLY273 


c 


8.207 


7.423 


-5.037 


2071 


GLY273 


0 


7.257 


7.968 


-4.456 


2072 


LYS274 


N 


8.167 


6.171 


-5.473 


2073 


LYS274 


CA 


6.916 


5.418 


-5.566 


2074 


LYS274 


c 


6.39 


4.954 


-4.21 


2075 


LYS274 


o 


5.225 


4.55 


-4.144 


2076 


LYS274 


CB 


7.138 


4.184 


-6.431 


2077 


LYS274 


CG 


7.528 


4.547 


-7.856 


2078 


LYS274 


CD 


7.755 


3.287 


-8,681 


2079 


LYS274 


CE 


8.177 


3,615 


-10.107 


2080 


LYS274 


NZ 


8.465 


2.383 


-10.858 


2081 


THR275 


N 


7.12 


5.182 


-3.127 


2082 


THR275 


CA 


6.593 


4.814 


-1.813 


2083 


THR275 


C 


5.522 


5.804 


-1.352 


2084 


THR275 


0 


4.516 


5.354 


-0,798 


2085 


THR275 


CB 


7.725 


4.76 


-0.789 


2086 


THR275 


0G1 


8.169 


6.074 


-0.485 


2087 


THR275 


CG2 


8.912 


3,963 


-1.305 


2088 


LEU276 


N 


5.564 


7.036 


-1.844 


2089 


LEU276 


CA 


4.543 


8.012 


■1.446 


2090 


LEU276 


C 


3.312 


7.905 


-2.346 


2091 


LEU276 


0 


2.175 


8,051 


-1.876 


2092 


LEU276 


CB 


5.14 


9.411 


-1.538 


2093 


LEU276 


CG 


4.182 


10.462 


-0.987 


2094 


LEU276 


CD1 


3.836 


10.177 


0.472 


2095 


LEU276 


CD2 


4.77 


11.861 


-1.133 


2096 


ALA277 


N 


3.53 


7.375 


-3.539 


2097 


ALA277 


CA 


2.417 


7.126 


-4.451 


2098 


ALA277 


C 


1.711 


5.836 


-4.052 


2099 


ALA277 


0 


0.475 


5.796 


-4.026 


2100 


ALA277 


CB 


2.963 


7.021 


-5.869 


2101 


ALA278 


N 


2.472 


4.947 


-3.431 


2102 


ALA278 


CA 


1.909 


3.727 


-2.859 


2103 


ALA278 


C 


1.12 


4.01 


-1.591 


2104 


ALA278 


0 


0.051 


3,416 


-1.427 
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2105 


ALA278 


CB 


3.048 


2.773 


-2.523 


2106 


ILE279 


N 


1.472 


5.064 


-0.867 


2107 


ILE279 


CA 


0.698 


5.447 


0.324 


2108 


ILE279 


C 


-0.692 
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-15.995 


5 


2921 


GLU385 


CA 


-6.994 


-16.177 


6.448 


2922 


GLU385 


c 


-6.674 


-14.884 


7,194 


2923 


GLU385 


0 


-6.638 


-14.896 


8.429 


2924 


GHJ385 


CB 


-6.01 


-17.259 


6.874 


2925 


GLU385 


CG 


-6.219 


-18.561 


6.111 


2926 


GLU385 


CD 


-7.651 


-19.079 


6.248 


2927 


GLU385 


0E1 


-8.017 


-19.462 


7.349 


2928 


GLU385 


GE2 


-8.266 


-19.256 


5.205 


2929 


LEU386 


N 


-6.406 


-13.81 


6.463 


2930 


LEU386 


CA 


-6.082 


-12.519 


7.093 


2931 


LEU386 


c 


-7.266 


-1 1 .953 


7.874 


2932 


LEU386 


0 


-8.342 


-1 1 .71 


7.315 


2933 


LEU386 

Lb L_ V./ V.J U U 


CB 


-5.676 


-11.542 


5.996 


2934 


LELI386 


CG 


-4.348 


-1 1 .943 


5.365 


2935 


LFLI386 


CD1 


*T.VO 1 


* 1 1 . 1 oo 


4 OQI 


2936 




HDP 




-11 ll'X 
1 1 . / t o 




2937 




N 




-11 7QR 
II./ 


Q 17*^ 


2938 




HA 


-ft 1*^? 


-1 1 '^Cl 


ID noi 

1 \J.\JiS 1 






c. 






in n47 


2940 




o 




-Q HQS 




2941 


PRO'^87 


CB 


-7 R47 


-1 1 .801 


1 1 .445 


2942 


PRO'^R7 




-fi 1Q1 


-IP PP4 




2943 


PRn'^R7 


CD 


-'^ fl17 


-1P irm 

1 1 u*^ 




2944 


111 ^ooo 


N 


-Q '^14 




9.143 


2945 


PHF38A 


CA 


-Q 77^ 


-8.145 


Q 01P 


2946 


PHF388 


Q 


-10.688 


-7.79 


10.176 


2947 


PHE388 


0 


-1 1.522 


-8.597 


10.603 


2948 


PHF'Via 


CB 


-10.558 


-7.999 


7.709 


2949 


PHE388 


CG 


-9.785 


-8.343 


6.437 


2950 


PHE388 


CD1 


-8.78 


-7.498 


5.987 


2951 


PHE388 


CD2 


-10.097 


-9.492 


5.721 


2952 


PHE388 

1 1 1*1* www 


CE1 


-8.076 


-7.809 


4.831 


2953 


PHE388 


CE2 


-9.393 


-9.804 


4.565 


2954 


PHE388 


CZ 


-8.381 


-8.963 


4.121 


2955 


LYS389 


N 


-10.5 


-6.599 


10.707 


2956 


LYS389 


CA 


-11.364 


-6.141 


1 1 .792 


2957 


LYS389 


C 


-12.626 


-5.52 


1 1 .203 


2958 


LYS389 


0 


-12.542 


-4.581 


10.4 


2959 


LYS389 


CB 


-10.611 


-5.115 


12.633 


2960 


LYS389 


CG 


-11.439 


-4.708 


13.847 


2961 


LYS389 


CD 


-10.677 


-3.763 


14.767 


2962 


LYS389 


CE 


-11.487 


-3.466 


16.023 


2963 


LYS389 


NZ 


-10.719 


-2.637 


16.96 


2964 


HIS390 


N 


-13.775 


-6.068 


11.571 


2965 


HIS390 


CA 


-15.055 


-5.523 


11.102 


2966 


HIS390 


C 


-15.385 


-4.226 


11.836 


2967 


HIS390 


0 


-15.845 


-4.213 


12.983 


2968 


HIS390 


CB 


-16.162 


-6.548 


11.316 
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2969 


HIS390 


CG 


-17.525 


-6.094 


10.826 


2970 


HIS390 


ND1 


-17.895 


-5.893 


9.545 


2971 


HIS390 


CD2 


-18.62 


-5.81 


11.607 


2972 


HIS390 


CE1 


-19.181 


-5.487 


9.511 


2973 


HIS390 


NE2 


-19.629 


-5.437 


10.786 


2974 


ASP391 


N 


-15.053 


-3.138 


11.167 


2975 


ASP391 


CA 


-15.269 


-1 .789 


11.683 


2976 


ASP391 


C 


-15.392 


-0.871 


10.48 


2977 


ASP391 


0 


-14.395 


-0.539 


9.835 


2978 


ASP391 


CB 


-14.068 


-1.414 


12.553 


2979 


ASP391 


CG 


-14.172 


-0.02 


13.18 


2980 


ASP391 


0D1 


-14.984 


0.771 


12.707 


2981 


ASP391 


0D2 


-13.241 


0.324 


13.884 


2982 


SER392 


N 


-16.582 


-0.335 


10.283 


2983 


SER392 


CA 


-16.835 


0.448 


9.075 


2984 


SER392 


c 


-16.261 


1.863 


9.129 


2985 


SER392 


0 


-15.99 


2.431 


8.07 


2986 


SER392 


CB 


-18.342 


0.537 


8,868 


2987 


SER392 


OG 


-18.876 


1.383 


9.878 


2988 


THR393 


N 


-15.944 


2.392 


10.297 


2989 


THR393 


CA 


-15.451 


3.768 


10.313 


2990 


THR393 


c 


-13.931 


3.822 


10.468 


2991 


THR393 


0 


-13.294 


4.722 


9.909 


2992 


THR393 


CB 


-16.183 


4.53 


1 1.41 1 


2993 


THR393 

II II IW V V 


0G1 


-17.557 


4.561 


11.04 


2994 


THR393 

II 11 1 WW W 


CG2 


-15-703 


5.972 


11.528 


2995 


ILE394 


N 


-13.365 


2.816 


11.118 


2996 


ILE394 


CA 


-1 1.899 


2.665 


11.203 


2997 


ILE394 


c 


-11.525 


1.204 


10.92 


2998 


ILE394 


0 


-1 1 .205 


0.419 


11.824 


2999 


ILE394 


CB 


-11.37 


3.095 


12.577 


3000 


ILE394 


CG1 


-11,744 


4.533 


12.921 


3001 


ILE394 


CG2 


-9.847 


2.978 


12.624 


3002 


ILE394 


CD1 


-10.977 


5.529 


12.055 


3003 


TYR395 


N 


-11.59 


0.854 


9.649 


3004 


TYR395 


CA 


-11.29 


-0.503 


9.179 


3005 


TYR395 


c 


-9.792 


-0.786 


9.302 


3006 


TYR395 


0 


-8.997 


0.149 


9.447 


3007 


TYR395 


CB 


-11.747 


-0.572 


7.721 


3008 


TYR395 


CG 


-11.784 


-1 .967 


7.101 


3009 


TYR395 


CD1 


-10.958 


-2.272 


6.026 


3010 


TYR395 


CD2 


-12.648 


-2.927 


7.612 


3011 


TYR395 


CE1 


-10.991 


-3.543 


5.465 


3012 


TYR395 


CE2 


-12.682 


-4.199 


7.052 


3013 


TYR395 


CZ 


-11.852 


-4.502 


5-982 


3014 


TYR395 


OH 


-11.882 


-5.763 


5.427 


3015 


GLY396 


N 


-9.433 


-2.053 


9.421 


3016 


GLY396 


CA 


-8,007 


-2.401 


9.468 


3017 


GLY396 


C 


-7.732 


-3.871 


9.759 


3018 


GLY396 


0 


■8.601 


-4.74 


9.609 


3019 


LEU397 


N 


-6.493 


-4.132 


10.134 


3020 


LEU397 


CA 


-6.031 


-5.497 


10.406 


3021 


LEU397 


C 


-5.334 


-5.577 


11.751 


3022 


LEU397 


0 


-4.297 


-4.938 


11.961 
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3023 


LEU397 


CB 


-5.051 


-5.894 


9.311 


3024 


LEU397 


CG 


-5.773 


-6.502 


8.12 


3025 


LEU397 


CD1 


-5.037 


-6.225 


6.822 


3026 


LEU397 


CD2 


-5.979 


-7.996 


8.325 


3027 


HIS398 


N 


-5.87 


-6.402 


12.634 


3028 


HIS398 


CA 


-5.274 


-6.514 


13.967 


3029 


HIS398 


c 


-4.348 


-7.718 


14.107 


3030 


HIS398 


0 


-3.651 


-7.848 


15.12 


3031 


HIS398 


CB 


-6.363 


-6.528 


15.033 


3032 




CG 


-6.737 


■5.14 


15.525 


3033 




ND1 


-7.052 


-4.804 


16.79 


3034 




CD2 


-6.795 


-3.984 


14.781 


3035 




CE1 


-7.31 1 


-3.482 


16.851 








-7 


-2.975 


15.607 






N 




-8.567 


13.094 




Al A'^QQ 


PA 




-9.671 


13.12 




Al A'^QQ 


c 






1 1 .756 


\jKJH\J 


Al A'^QQ 


n 




-10.014 


10.724 


0\JH 1 


Al A'^QQ 


PR 




-10 Q'^R 


1 3.633 




1 Fl lACiCi 


M 


-1 '5ft'5 


-10 004 


1 1.794 


OV/HO 




PA 


-0 '^R7 


-10 P04 


10.589 




1 PI lAOO 




0 77P 


-10 ft'^R 


10.935 


OU*fO 


1 PI i^nn 


\j 


1 71 9 
1 ./ 1 ^ 


-10 Ifi*^ 


1 1 




1 PI lAnn 


PR 


-0 ^507 


-ft ft4^ 


Q Q4R 

J7.57HD 




I PI lAnn 


Pr^ 


0 R1 

U.D 1 O 




fl 7'5R 




1 PI I4nn 


pni 


0 OOR 




7 R44 




1 PI Mon 


PRO 


0 07*5 


-7 '%79 


fl 1Q7 


OUDU 




M 
IN 


0 ft1 *\ 


- 1 ^. 1 / o 


in ftfli 




ppnAoi 


PA 


9 Oft4 


-1 9 Q 


in Q74 




PPOAHI 


p 






Q R71 




PROAOI 
r nWrv 1 




9 4,1 '5 


1 O.^H^ 


R fiOP 




PROdni 


PR 


1 RRR 


-14 "^IP 


11 PRQ 




pRO^ni 




0 1Q7 


-1 4 4fiR 


10.993 




rr\\JH\J I 


pn 


-0 '^nfl 


.n OR 


10.625 




VAI ADO 


M 


A OVA 


-IP Pfi7 


9.78 




VAI Anp 


PA 


^ OOfl 


-IP IRT 


8.653 




VAI ADP 






-IP fl14 


8.998 


3060 


VAL402 


0 


7 008 


-12.485 


9.998 


3061 


VAL402 


CB 


5.194 


-10.723 


8.25 


3062 


VAL402 


CGI 


3.968 


-10.185 


7.523 


3063 


VAL402 


CG2 


5.553 


-9.84 


9.44 


3064 


THR403 


N 


6.772 


-13.729 


8.146 


3065 


THR403 


CA 


8.039 


-14.428 


8.342 


3066 


THR403 


c 


9.135 


-13.709 


7.571 


3067 


THR403 


0 


9.102 


-13.66 


6.335 


3068 


THR403 


CB 


7.888 


-15.853 


7.827 


3069 


THR403 


0G1 


6.715 


-16.403 


8.406 


3070 


THR403 


CG2 


9.077 


-16.723 


8.22 


3071 


TRP404 


N 


10.089 


-13.156 


8.298 


3072 


TRP404 


CA 


11.177 


-12.406 


7.66 


3073 


TRP404 


C 


12.136 


-13.344 


6.931 


3074 


TRP404 


0 


12.984 


-12.835 


6.21 


3075 


TRP404 


CB 


11.969 


-11.654 


8.719 


3076 


TRP404 


CG 


11.163 


-10.949 


9.79 
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o 



o 



3077 


TRP404 


CD1 


10.886 


-11.444 


11.043 


3078 


TRP404 


CD2 


10.559 


-9.637 


9.729 


3079 


TRP404 


NE1 


10,155 


-10.524 


11.721 


3080 


TRP404 


CE2 


9.943 


-9.428 


10.972 


3081 


TRP404 


CE3 


10.506 


-8.656 


8.749 


3082 


TRP404 


CZ2 


9.278 


-8.237 


11.225 


3083 


TRP404 


CZ3 


9.838 


-7.468 


9.009 


3084 


TRP404 


CH2 


9.226 


-7.257 


10.239 


3085 


TRP404 


OXT 


12.117 


-14.53 


7.239 


3086 


HEM1 


FE 


-8.08 


12.05 


10.226 


3087 


HEM1 


NA 


-9.653 


12.085 


9.078 


3088 


HEM1 


CIA 


-10.7 


13.004 


9.077 


3089 


HEM1 


C2A 


-11.687 


12.681 


8.118 


3090 


HEM1 


C3A 


-11.292 


11.525 


7.568 


3091 


HEM1 


C4A 


-10.019 


11.174 


8.129 


3092 


HEM1 


CHB 


-9.224 


10.115 


7.699 


3093 


HEM1 


C1B 


-7.931 


9.83 


8.181 


3094 


HEM1 


NB 


-7.308 


10.582 


9.182 


3095 


HEM1 


C4B 


-6.086 


9.964 


9.364 


3096 


HEM1 


C3B 


-5.946 


8.85 


8.506 


3097 


HEM1 


C2B 


-7.068 


8.771 


7.746 


3098 


HEM1 


CMB 


-7.416 


7.755 


6.682 


3099 


HEM1 


CAB 


-4.833 


8.031 


8.591 


3100 


HEM1 


CBB 


-4.44 


7.051 


7.74 


3101 


HEM1 


CHC 


-5.212 


10.298 


10.374 


3102 


HEM1 


C1G 


-5.439 


1 1 .223 


1 1 .336 




HEM1 


NC 


-6.519 


12.039 


1 1 .384 


3104 


HEM1 


C4C 


-6-227 


12.887 


12.426 


3105 


HEM1 


C3C 


-4.926 


12.636 


13.002 


3106 


HEM1 


C2C 


-4.491 


1 1 .556 


12.313 


3107 


HEM1 


CMC 


-3.265 


10.712 


12.532 


3108 


HEM1 


CAC 


-4.462 


13.435 


14.055 


3109 


HEM1 


CBC 


-3.452 


13.231 


14.936 


31 10 


HEM1 


CHD 


-7.061 


13.855 


12.91 


3111 


HEM1 


C1D 


-8.237 


14.203 


12.292 


3112 


HEM1 


ND 


-8.777 


13.572 


11.18 


3113 


HEM1 


C4D 


-9.915 


14.313 


10.916 


3114 


HEM1 


C3D 


-10.045 


15.413 


11.808 


3115 


HEM1 


C2D 


-9.006 


15.334 


12.673 


3116 


HEM1 


CMD 


-8.71 


16.241 


13.844 


3117 


HEM1 


CAD 


-11.178 


16.421 


11.802 


3118 


HEM1 


CBD 


-10.91 


17.624 


10.918 


3119 


HEM1 


CGD 


-12.079 


18.574 


10.862 


3120 


HEM1 


01 D 


-13.198 


18.167 


11.204 


3121 


HEM1 


02D 


-11.889 


19.736 


10.477 


3122 


HEM1 


CHA 


-10.849 


14.026 


9.961 


3123 


HEM1 


CMA 


-12.005 


10.703 


6.498 


3124 


HEM1 


CAA 


-12.907 


13.51 


7.748 


3125 


HEM1 


CBA 


-14.087 


13.112 


8.645 


3126 


HEM1 


CGA 


-15.442 


13.596 


8.14 


3127 


HEM1 


01A 


-15.522 


14.131 


7.009 


3128 


HEM1 


02A 


-16.439 


13.4 


8.866 
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5 What is Claimed is ; 

1. An isolated nucleic acid sequence encoding epothilone B hydroxylase 
or a mutant or variant thereof. 

10 2. The isolated nucleic acid sequence of claim 1 comprising SEQ E) NO: 

1, 30, 32, 34, 36, 37. 38, 39, 40, 41, 42, 60, 62, 64, 66, 68, 72 or 74. 

3. The isolated nucleic acid sequence of claim 1 comprising SEQ ID 

N0:1. 

15 

4. The isolated nucleic acid sequence of claim 1 encoding a mutant with 
at least one amino acid substitution in an active site of the epothilone B hydroxylase 
enzyme. 

20 5. The isolated nucleic acid sequence of claim 1 encoding a mutant with 

at least one amino acid substitution at amino acid GLU31, ARG67, ARG88, ILE92, 
ALA93, VAL106, ILE130, ALA140, MET176, PHE190, GLU 231, SER294, 
PHE237, or ILE365 of SEQ ID N0:2. 

25 6. The isolated nucleic acid sequence of claim 1 encoding a mutant with 

at least one amino acid substitution at amino acid LEU39, GLN43, ALA45, MET57, 
LEU58, HIS62, PHE63, SER64, SER65, ASP66, ARG67, GLN68, SER69, LEU74, 
MET75, VAL76, ALA77, ARG78, GLN79, ILE80, ASP84, LYS85, PR086, PHE87, 
ARG88, PR089, SER90. LEU91, ILE92, ALA93, MET94, ASP95, HIS99, ARG103. 

30 PHEl 10, ILE155, PHE169, GLN170, CYS172, SER173, SER174, ARG175, 
MET176, LEU177, SER178, ARG179. ARG186, PHE190, LEU193, VAL233, 
GLY234, LEU235, ALA236, PHE237, LEU238, LEU239, LEU240, ILE241, 
ALA242, GLY243. HIS244, GLU245, THR246, THR247, ALA248, ASN249. 
MET250, LEU283, THR287, ILE288, ALA289, GLU290, THR291, ALA292, 

35 THR293, SER294, ARG295, PHE296, ALA297, THR298, GLU3 12, GLY3 13, 
VAL314, VAL315. GLY316, VAL344, ALA345, PHE346, GLY347, PHE348, 
VAL350, HIS351, GLN352, CYS353, LEU354, GLY355, GLN356, LEU358, 
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5 ALA359, GLU362, LYS389, ASP391, SER392, THR393, ILE394, or TYR395 of 
SEQ ID N0:2. 

7. The isolated nucleic acid sequence of claim 1 encoding a variant 
comprising SEQ ID NO:43, 44, 45, 46, 47, 48 or 49. 

10 

8. A polypeptide encoded by the isolated nucleic acid sequence of claim 

1. 

9. An isolated nucleic acid molecule that is capable of hybridizing to a 
15 nucleic acid sequence of claim 2, or to the complementary sequence of said nucleic 

acid sequence, under hybridization conditions of 3X SSC at 65°C for 16 hours, said 
isolated nucleic acid molecule being capable of remaining hybridized to said nucleic 
acid sequence, or to the complementary sequence of said nucleic acid sequence, under 
wash conditions of 0.5X SSC, 55X for 30 minutes. 

20 

10. An isolated polypeptide comprising SEQ ID NO:2. 

11. An isolated mutant polypeptide of epothilone B hydroxylase of SEQ 
25 ID N0:2 comprising an amino acid sequence with at least one amino acid substitution 

in an active site of epothilone B hydroxylase enzyme of SEQ ID N0:2. 

12. An isolated mutant polypeptide of epothilone B hydroxylase of SEQ 
ID N0:2 comprising an amino acid sequence with at least one amino acid substitution 

30 at amino acid GLU31, ARG67, ARG88, ILE92, ALA93. VAL106, ILE130, 

ALA140, MET176, PHE190, GLU 231, SER294, PHE237, or ILE365 of SEQ ID 
N0:2. 

13. An isolated mutant polypeptide of epothilone B hydroxylase of SEQ 
35 ID N0:2 comprising an amino acid sequence with at least one amino acid substitution 

at amino acid LEU39, GLN43, ALA45, MET57, LEU58, HIS62, PHE63, SER64, 
SER65, ASP66, ARG67, GLN68, SER69, LEU74, MET75, VAL76, ALA77, 
ARG78, GLN79, ILE80, ASP84, LYS85, PR086, PHE87, ARG88, PR089, SER90, 
LEU91, ILE92, ALA93, MET94, ASP95, HIS99. ARG103, PHEllO, ILE155, 
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5 PHEI69, GLN170, CYS172, SER173, SER174, ARG175, MET176, LEU177, 
SER178, ARG179, ARG186, PHE190, LEU193, VAL233, GLY234, LEU235, 
ALA236, PHE237, LEU238, LEU239, LEU240, ILE241, ALA242, GLY243, 
HIS244, GLU245, THR245, THR247, ALA248, ASN249, MET250, LEU283, 
THR287, ILE288, ALA289, GLU290, THR291, ALA292, THR293, SER294, 
10 ARG295, PHE296, ALA297, THR298, GLU312, GLY313. VAL314, VAL315, 
GLY316, VAL344, ALA345, PHE346, GLY347, PHE348, VAL350. HIS351, 
GLN352, CYS353, LEU354, GLY355, GLN356, LEU358, ALA359, GLU362, 
LYS389, ASP391, SER392, THR393, ILE394, or TYR395 of SEQ ID N0:2. 

15 14. An isolated mutant polypeptide of epothilone B hydroxylase 

comprising SEQ ID NO: 31, 33, 35, 61, 63, 65, 67, 69, 71, 73 or 75. 

15. An isolated variant polypeptide of epothilone B hydroxylase 
comprising SEQ ID NO: 43, 44, 45, 46, 47, 48 or 49. 

20 

16. An isolated nucleic acid sequence encoding a fenedoxin. 

1 7. The isolated nucleic acid sequence of claim 1 6 comprising SEQ ID 

N0:3. 

25 

18. A polypeptide encoded by the isolated nucleic acid sequence of claim 

16. 

19. An isolated nucleic acid molecule that is capable of hybridizing to the 
30 nucleic acid sequence set forth in SEQ ID N0:3, or to the complementary sequence of 

the nucleic acid sequence set forth in SEQ ID N0:3, under hybridization conditions of 
3X SSC at 65°C for 16 hours, said isolated nucleic acid molecule being capable of 
remaining hybridized to the nucleic acid sequence set forth in SEQ ID N0:3, or to the 
complementary sequence of the nucleic acid sequence set forth in SEQ ID N0:3, 
35 under wash conditions of 0.5X SSC, 55°C for 30 minutes. 

20. A vector comprising the isolated nucleic acid sequence of claim 1. 

21. The vector of claim 20 further comprising an isolated nucleic acid 
40 sequence encoding a ferredoxin. 
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22. A host cell comprising the vector of claim 20. 



23. A host cell comprising the vector of claim 21. 



10 24. A method for producing recombinant microorganisms which 

hydroxylate epothilones having a terminal alkyl group to produce epothilones having 
a terminal hydroxyalkyl group, said method comprising transfecting a microorganism 
with the vector of claim 20 or 21 . 



15 25. A recombinantly produced microorganism that hydroxylates 

epothilones having a terminal alkyl group to produce epothilones having a terminal 
hydroxyalkyl group. 



26. The recombinantly produced microorganism of claim 25 wherein said 
20 microorganism expresses a nucleic acid sequence of SEQ ID NO: 1, 30, 32, 34, 36, 
37, 38, 39, 40, 41, 42, 60, 62, 64, 66, 68, 72 or 74. 



27. A method for the preparation of at least one epothilone of the 
following formula I 
25 HO-CH2-(A,)„-(Q)n,-(A2)o-E (I) 

where 

Ai and A2 are independently selected from the group of optionally substituted C1-C3 
alkyl and alkenyl; 

Q is an optionally substituted ring system containing one to three rings and at least 
30 one carbon to carbon double bond in at least one ring; 

n. m, and 0 are integers selected from the group consisting of zero and 1, where at 
least one of m or n or o is 1; and 
E is an epothilone core; 

comprising the steps of contacting at least one epothilone of the following formula D 

35 CH3-(Ai)n-(Q)m-(A2)o-E (II) 

where Ai, Q, A2, E. n, m, and o are defined as above; 
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5 with a recombinantly produced microorganism, or an enzyme derived therefrom, 
which is capable of selectively catalyzing the hydroxylation of Formula II. and 
effecting said hydroxylation. 

28. A method for the preparation of an epothilone analog of Formula A 




O OH O 

said method comprising biotransforming epothilone B to the epothilone analog of 
Formula A by incubation with a mutant epothilone B hydroxylase enzyme comprising 
SEQIDN0:31. 

29. A compound of Formula A 




0 OH O 

or a pharmaceutically acceptable salt thereof. 

30. A homology model of epothilone B hydroxylase having a root mean 
square deviation of conserved residue backbone atoms of less than about 4.0 A when 
superimposed on a corresponding backbone atoms described by structure coordinates 
listed in Appendix 1. 
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5 3 1 . A method for producing a mutant with altered biological properties, 

function, yield of a desired product, rate of reaction, substrate specificity, or activity 
as compared to epothilone B hydroxylase, said method comprising the steps of: 
identifying an amino acid of SEQ ID N0:2 to mutate; and mutating the identified 
amino acid to create a mutant protein. 

10 

32. The method of claim 3 1 wherein a homology model of epothilone B 
hydroxylase having a root mean square deviation of conserved residue backbone 
atoms of less than about 4.0 A when superimposed on a corresponding backbone 
atoms described by structure coordinates listed in Appendix 1 is used to identify an 

15 amino acid of SEQ ID NO: 2 to mutate. 

33. The method of claim 3 1 wherein the identified amino acid is LEU39, 
GLN43, ALA45, MET57, LEU58, HIS62, PHE63, SER64, SER65, ASP66, ARG67, 
GLN68, SER69, LEU74, MET75, VAL76, ALA77, ARG78, GLN79, ILE80, ASP84, 

20 LYS85, PR086, PHE87, ARG88, PR089, SER90, LEU91, ILE92, ALA93, MET94, 
ASP95, fflS99, ARG103, PHEllO, ILEI55, PHE169, GLN170, CYS172, SER173, 
SER174, ARG175, MET176, LEU177, SER178, ARG179, ARG186, PHE190, 
LEU193, VAL233, GLY234, LEU235, ALA236, PHE237, LEU238, LEU239, 
LEU240, E.E241, ALA242, GLY243, HIS244, GLU245, THR246, THR247, 

25 ALA248, ASN249, MET250, LEU283, THR287, ILE288, ALA289, GLU290, 
THR291, ALA292, THR293, SER294, ARG295, PHE296, ALA297, THR298, 
GLU3I2, GLY313, VAL314, VAL315, GLY316, VAL344, ALA345, PHE346, 
GLY347, PHE348, VAL350, HIS351, GLN352, CYS353, LEU354, GLY355, 
GLN356, LEU358, ALA359, GLU362, LYS389, ASP391. SER392. THR393, 

30 ILE394, or TYR395 of SEQ ID N0:2. 

34. The method of claim 31 wherein the identified amino acid is GLU31, 
ARG67, ARG88, ILE92, ALA93, VAL106, ILE130, ALAMO, MET176, PHE190, 
GLU 23 1 , SER294, PHE237, or ILE365 of SEQ ID N0:2. 

35 
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5 35. The method of claim 31 wherein the mutant protein improves yield of 

a desired product as compared to the yield of a desired product obtained using 
epothilone B hydroxylase. 

36. The method of claim 35 wherein the desired product is epothilone F. 

10 

37. The method of claim 3 1 wherein the mutant improves the rate of 
reaction as compared to the rate of reaction using epothilone B hydroxylase. 

38. The method of claim 31 wherein the mutant exhibits altered substrate 
15 specificity as compared to substrate specificity of epothilone B hydroxylase. 

39. The method of claim 38 wherein amino acid SER294 is mutated. 

40. The method of claim 3 1 wherein the mutant exhibits essentially 
20 similar biological activity or function to epothilone B hydroxylase. 

41. A machine-readable data storage medium comprising a data storage 
material encoded with structure coordinates set forth in Appendix 1. 
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epothilone B 



epothilone F 
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10 



15 



20 



Figure 2. Nucleotide sequence alignments used to design the PGR Primers: 
Alignment used to design primers P450-l"^and P450-la'*' 



STMSUACB 
N0:5) 
STMSUBCB 
NO: 6) 
3702259 
NO: 7) 
SSU65940 
NO: 8) 
STMOLEP 
NO: 9) 
SERCP450A 
NO:10) 



Alignment used to design primers P450-2'' and P450-2' 



tcctcatcgccggccacgagac 


(SEQ 


ID 


tgctggtcgccggccacgagac 


(SEQ 


ID 


tgctcatcaccggccaggacac 


(SEQ 


ID 


- -ctgttcgccgggcacgactc 


(SEQ 


ID 


tgctcatcgcgggccacgagac 


(SEQ 


ID 


tgctggtcgccgggcacgagac 


(SEQ 


ID 



25 


STMSUACB 
N0:11) 


cggcgcggtggaggaactgct 


(SEQ 


ID 




STMSUBCB 
NO: 12) 


gggcgccgtcgaggagctgct 


(SEQ 


ID 


30 


3702259 
NO: 13) 


ccgcaccctggaggagctgct 


(SEQ 


ID 




SSU65940 
NO: 14) 


C99cgcggtcgaggagatgct 


(SEQ 


ID 




STMOLEP 
NO: 15) 


cgcggcggtggaggagatgct 


(SEQ 


ID 


35 


SERCP450A 
NO: 16) 


cggcgcgatcgaggagaccct 


(SEQ 


ID 



40 



45 



50 



Alignment 

STMSUACB 

N0:17) 

STMSUBCB 

N0:18) 

3702259 

NO: 19) 

SSU65940 

N0:2G) 

STMOLEP 

N0:21) 

SERCP4 50A 

N0:22) 



used to design primer P450-3' 

ttcggcttcggcgtgcaccagtgcctgggc (SEQ ID 

ttcggcttcggcgtccaccagtgcctggga (SEQ ID 

ttcggctggggcccccaccactgcctgggc (SEQ ID 

ttcggtcacggcgtccacaagtgtcctggc (SEQ ID 

ttcgggcacggagcgcaccactgcatcggc (SEQ ID 

ttcggccacggcatccacttctgcgtgggc (SEQ ID 
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5 Figure 3: Sequence alignment between Epo-B hydroxylase 
and EryF (PDB code IJIN chain A) 

EPO-B MTDVEETTATLPLARKCPFSPP- - PEYERLRRESPVSRVGLPSGQTAWALTRLEDIREML 

IJINA ATVPDLESDSFHVDWYSTYAELRETAPVTPVRFL-GQDAWLVTGYDEAKAAL 

10 ...* .*.**.:**:*: **★*.* ::: * 

EPO-B SSPHFSSD--RQSPSFPLMVARQI--RREDKP-PRPSLIAMDPPEHGKARRDWGEFTVK 
IJINA SDLRLSSDPKKKYPGVEVEFPAYLGFPEDVRNYFATNMGTSDPPTHTRLRKLVSQEFTVR 
*. ::*** ::*.. : .. : .: : ;****:*;* **★*. 

15 

EPO-B RMKALQPRIQQIVDEHIDALLAGPKPADLVQALSLPVPSLVICELLGVPYSDHEFFQSCS 
IJINA RVEAMRPRVEQITAELLDEV-GDSGWDIVDRFAHPLPIKVICELLGVDEAARGAFGRWS 

4-*^ .* . .. * I * ******** . * * 

20 EPO-B SRMLSREVT-AEERMTAFESLENYLDELVTKKEANATEDDLLGRQILKQRESGEADHGEL 
IJINA SEILVMDPERAEQRGQAAREWNFILDLVERRRTEPGDDLLSALISVQDDDDGRLSADEL 
: **:* * :** :* * , ;;: :.*. . .** 

EPO-B VGLAFLLLIAGHETTANMISLGTVTLLENPDQLAKIKADPGKTLAAIEELLRIFTIAETA 
25 IJINA TSIALVLLLAGFEASVSLIGIGTYLLLTHPDQLALVRADPSALPNAVEEILRYIAPPETT 
^ .*..**.***...*_, ** ** .*★*** ..***^ *.**.** .**: 

EPO-B TSRFATADVEIGGTLIRAGEGWGLSNAGNHDPDGFENPDTFDIERGARHHVAFGFGVHQ 
IJINA T-RFAAEEVEIGGVAIPQYSTVLVANGAANRDPSQFPDPHRFDVTRDTRGHLSFGQGIHF 

30 ****;.****** *. **.♦**.***.*.**..***.* 

EPO-B CLGQNLARLELQIVFDTLFRRVPGIRIAVPVDELPFKHDSTIYGLHALPVTW-- 
IJINA CMGRPLAKLEGEVALRALFGRFPALSLGIDADDWWRRSLLLRGIDHLPVRLDG 
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